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CoMPOUNDS Having Affinity for the Granulocyte-Colony 
Stimulating Factor Receptor (G-CSFR) and Associated Uses 



Technical Field 

5 The present invention relates generally to novel compounds that have affinity 

for the granulocyte-colony stimulating factor receptor (G-CSFR). More particularly, the 
invention relates to such compounds which act as G-CSF mimetics by activating or 
inactivating the G-CSFR, or by affecting ligand binding to G-CSFR. The invention 
additionally relates to methods of using the novel compounds and pharmaceutical 
10 compositions containing a compound of the invention as the active agent. The invention 
has application in the fields of biochemistry and medicinal chemistry and particularly 
provides G-CSF mimetics for use in the treatment of human disease. 



15 
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Background 

Granulocyte-colony stimulating factor (G-CSF) is a hematopoietic growth factor 
that specifically stimulates proliferation and differentiation of cells of the neutrophilic 
lineage. 

5 G-CSF is a cytokine that binds to and activates the granulocyte-colony 

stimulating factor receptor (G-CSFR). G-CSFR is expressed on the surface of mature 
neutrophils and cells committed to the neutrophilic lineage, with receptor density varying 
from 190 to more than 1400 sites per cell. The receptor is a member of the cytokine 
receptor superfamily; it contains a cytokine receptor-homologous domain responsible for 

10 G-CSF binding, an immunoglobulin-like domain, three fibronectin type III domains, a 
transmembrane region, and an intracellular domain. The observed affinity of G-CSF for 
its receptor is about 1 00 pM. 

The complete G-CSF protein has become an important therapeutic agent in 
clinical indications involving depressed neutrophil counts. Such indications include 

15 chemotherapy-induced neutropenia, AIDS and community acquired pneumonia. 

Furthermore, G-CSF antagonists may be useful in the treatment of some diseases caused 
by an inappropriate or undesirable activation of G-CSFR. 

There remains a need, however, for compounds that bind specifically to G- 
CSFR, both for studies of the important biological activities mediated by the receptor and 

20 for treatment of diseases, disorders and conditions that would benefit from activating or 
inactivating G-CSFR. The present invention provides such compounds, and also provides 
pharmaceutical compositions and methods for using the compounds as therapeutic agents. 



Summary of the Invention 

25 In one embodiment, the invention provides compounds comprising a peptide 

chain that binds to G-CSFR. In one aspect, the peptide chain is approximately 10 to 40 
amino acids in length and contains a sequence of amino acids of formula (I) 
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(I) 0X1X2X3X4X5X6X7X80 (SEQ ID NO: 1 ) 
wherein each amino acid is indicated by standard one-letter abbreviation, and wherein Xj 
is A, N, S, F, D, G, L, T, E, V, P, Q, H, M or K; X2 is M, G, R, H, D, I, V, A, S, E, N, F, 
Y, P, C, W or T; X3 is E, V, W, F, M, A, N, S, L, T, Y, G or P; X4 is V, I, G, Q, W, M, T, 
5 Y, L, P, D, 0, E or A; X5 isM, E, W, L, P,N, I, T, V, F, Y, Q, S, R, W, G, H or D; X^ is 
H, A, W, Y, V, F, Q, M, N, E, S, D, P or G; X7 is M, F, Y, V, N, L, H, D, S, W, G, Q, C or 
T; and Xg is C, Y, R, I, K, W, L, E, M, H, A, T, F, D, P, G or Q. 

In another aspect, the peptide chain is approximately 9 to 40 amino acids in 
length and contains a sequence of amino acids of formula (II) 
10 (II) X\X'2X'3SGWVWX^4 (SEQ ID NO: 2) 

wherein each amino acid is indicated by the standard one-letter abbreviation, and wherein 
X', is S, Q, R, L or Y; X'2 is N, S, T, A or D; X'3 is E, D or N; and X'4 is L V, T, P or H. 

In another aspect, the peptide chain is 6 to 40 amino acids in length and contains 
a sequence of amino acids of formula (III) 
15 (III) ERX",X"2X"30 (SEQ ID NO: 3) 

wherein each amino acid is indicated by standard one-letter abbreviation, and wherein X", 
is D, L, S, G, E, A, K or Y; X^ is W, Y, F, L or V; and X\ is F, G, M or L. 

In still another aspect, the peptide chain is approximately 9 to 40 amino acids in 
length and contains a sequence of amino acids of formula (IV) 
20 (IV) X™ MVYX'"2X'"3PX™4W (SEQ ID NO: 4) 

wherein each amino acid in indicated by standard one-letter abbreviation, and wherein X™i 
is D or E; X"'2 is A or T; X"'3 is Y or V; and X"'4 is P or Y. 

In an additional aspect, the invention provides compounds comprising a peptide 
chain approximately 12 to 40 amino acids in length and contains a sequence of amino 
25 acids of formula (V) 

(V) CX'^,X'^2X'^X^X^^5X'''6X^X'^8X%X^^,oC (SEQ ID NO: 5) 
wherein each amino acid is indicated by standard one-letter abbreviation, and wherein X'^, 
is E, G, P, N, R, T, W, S, L, H, A, Q or Y; X'^ is S, T, E, A, D, G, W, P, L, N, V, Y, R or 
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M; is R, Y, V, Q, E, T, L, P, S, K, M, A or W; X'^ is L, M, G, ¥, W, R, S, V, P, A, 
D, C or T; X'^^ is V, T, A, R, S, L, W, C, I, E, P, H, F, D or Q; X'\ is E, Y, G, T, Q, M, S, 
N, A or P; X^\ is C, V, D, G, L, W, E, V, I, S, M or A; X'\ is S, Y, A, W, P, V, L, Q, G, 
K, F, I, E or D; X% is R, W, M, D, H, V, G, A, Q, L, S, E or Y; X'^o is M, L, I, S, V, P, 
5 W, F, T, Y, R, or Q. 

In another aspect the peptide chain is approximately 9 to 40 amino acids in 
length and contains a sequence of amino acids of formula (VI) 

(VI) X^,X^2X''3X\X^5X\CX\X^8 (SEQ ID NO: 6) 

wherein each amino acid is indicated by standard one-letter abbreviation, and wherein X^j 
10 is E, C, Q, V, or Y; X^2 is E, A, L, M, S, W, or Q; X^ is K, R or T; X\ is L, A, or V; X^5 
is R, A, M, H, E, V, L, G, D, Q, or S; X^g is E or V; X^ is A or G; X\ is R, H, G or L. 

In a further aspect, the peptide chain is approximately 10 to 40 amino acids in 
length that binds to G-CSFR and contains a sequence of amino acids of formula (VII) 

(VII) X^iX^i^X^'sX^X^^jEX^fiX^X^^sX^ (SEQ ID NO: 7) 

15 wherein each amino acid is indicated by standard one-letter abbreviation, and wherein X^', 
is A, E or G; X''^ is E, H or D; X^'3 is R or G; X''^4 is K, Y, M, N, Q, R, D, I, S or E; X^'j 
is A, S or P; X^'^ is E, D, T, Q, K or A: X% is R, W, K, L, S, A or Q; is R or E; and 
X% is W, G,orR. 

In a final aspect, the invention also provides peptides that, while not necessarily 
20 corresponding to one of the above-defined formulas, bind to G-CSFR. 

In some contexts, the compounds of the invention are preferably in the form of a 
dimer. It is also preferred, in some contexts, that the compounds of the invention include 
a peptide wherein the N-terminus of the peptide is coupled to a polyethylene glycol 
molecule. In some contexts, it is preferred that the compounds of the invention include a 
25 peptide wherein the N-terminus of the peptide is acetylated. In addition, it is preferred, in 
some contexts, that the compoimds of the invention include a peptide wherein the C- 
terminus of the peptide is amidated. 
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The invention also provides a pharmaceutical composition that comprises a 
therapeutically effective amount of a compound of the invention in combination with a 
pharmaceutically acceptable carrier, as well as a method for treating a patient who would 
benefit from a G-CSFR modulator, the method comprising administering to the patient a 
5 therapeutically effective amount of a compound of the present invention. 




Brief Description of the Drawings 

^ %ures 1-1, 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, 1-10 and 1-11 provide the 
sequences of F^resentative peptide chains contained within the compounds of the 
10 invention. 

Figures 2, 3, 4, 5, 6, 7, 8, 9A, 9B lOA, lOB and 1 1 are graphs showing the 
results of various assays described in Examples. 



Detailed Description of the Invention 

15 

I. Definitions and Overview 

It is to be understood that unless otherwise indicated, this invention is not 
limited to specific peptide sequences, molecular structures, pharmaceutical compositions, 
or the like, as such may vary. It is also to be understood that the terminology used herein 
20 is for the purpose of describing particular embodiments only and is not intended to be 
limiting. 

It must be noted that, as used in the specification and the appended claims, the 
singular forms "a," "an" and "the" include plural referents unless the context clearly 
dictates otherwise. Thus, for example, reference to "a novel compound" in a 
25 pharmaceutical composition means that more than one of the novel compounds can be 
present in the composition, reference to "a pharmaceutically acceptable carrier" includes 
combinations of such carriers, and the like. 
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In this specification and in the claims that follow, reference will be made to a 
number of terms which shall be defined to have the following meanings: 

Amino acid residues in peptides are abbreviated as follows: Phenylalanine 
is Phe or F; Leucine is Leu or L; Isoleucine is He or I; Methionine is Met or M; Valine is 
5 Val or V; Serine is Ser or S; Proline is Pro or P; Threonine is Thr or T; Alanine is Ala or 
A; Tyrosine is Tyr or Y; Histidine is His or H; Glutamine is Gin or Q; Asparagine is Asn 
or N; Lysine is Lys or K; Aspartic Acid is Asp or D; Glutamic Acid is Glu or E; Cysteine 
is Cys or C; Tryptophan is Trp or W; Arginine is Arg or R; and Glycine is Gly or G. In 
addition, " l-Nal" is used to refer to 1-naphthylalanine, the "2-Nar' is used to refer to 
10 2-naphthylalanine. 

Stereoisomers (e.g., D-amino acids) of the twenty conventional amino acids, 
unnatural amino acids such as a,a-disubstituted amino acids, N-alkyl amino acids, lactic 
acid, and other unconventional amino acids may also be suitable components for 
compounds of the present invention. Examples of unconventional amino acids include: 
15 p-alanine, 1-naphthylalanine, 2-naphthylalanine, 3-pyridylalanine, 4-hydroxyproline, 
0-phosphoserine, N-acetylserine, N-formylmethionine, 3-methylhistidine, 
5 -hydroxy lysine, nor-leucine, and other similar amino acids and imino acids (e.g., 
4-hydroxyproHne). 

"Peptide" or "polypeptide" refers to a polymer in which the monomers are alpha 
20 amino acids joined together through amide bonds. Peptides are two or often more amino 
acid monomers long. One or more of the peptide chains disclosed herein may appear in 
the compounds of the present. It is also contemplated that the peptide chains disclosed 
herein represent only a portion of the overall peptide included in the compound. 

The term "dimer" as in a peptide '*dimer" refers to a compound in which two 
25 peptide chains are linked; generally, although not necessarily, the two peptide chains will 
be identical and are linked through a linking moiety covalently bound to the carboxyl 
terminus of each chain. 
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The term "agonist" is used herein to refer to a ligand that binds to a receptor and 
activates the receptor. 

The term "antagonist" is used herein to refer to a ligand that binds to a receptor 
without activating the receptor. Antagonists are either competitive antagonists or 
5 noncompetitive antagonists. A "competitive antagonist" blocks the receptor site that is 
specific for the agonist. A "noncompetitive antagonist" inactivates or otherwise affects 
the functioning of the receptor by interacting with a site other than the agonist binding 
site. 

The term "modulator" as in a "G-CSFR-modulator" refers to a compound that is 
10 either an agonist or antagonist of the G-CSFR. 

"Pharmaceutically or therapeutically effective dose or amount" refers to a 
dosage level sufficient to induce a desired biological result. That result can be alleviation 
of the signs, symptoms, or causes of a disease, or any other desired alteration of a 
biological system. Preferably, this dose or amount will be sufficient to either at least 
15 partially activate or at least partially inactivate G-CSFR and, thus, alleviate the symptoms 
associated with an undesired neutrophil count in vivo. 

An "optimal neutrophil count" refers to a quantity of neutrophils in a patient that 
is determined by a clinician to be optimal for that patient in light of the patient's disease 
state, condition, etc. 

20 An "undesired neutrophil count" refers to a quantity of neutrophils in a patient 

that is determined by a clinician to be not optimal for that patient in light of the patient's 
disease state, condition, etc. Thus, an undesired neutrophil count may be depressed, 
elevated or even equal to the expected neutrophil count so long as the clinician determines 
that the actual count is not optimal for the patient. The compounds of the present 

25 invention are intended to, inter alia, provide the clinician with compounds that, when 
administered to a patient, bring that patient's neutrophil count closer to an optimal 
neutrophil count. 
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The term "treat" as in "treat a disease" is intended to include any means of 
treating a disease in a mammal, including (1) preventing the disease, i.e., avoiding any 
clinical symptoms of the disease, (2) inhibiting the disease, that is, arresting the 
development or progression of clinical symptoms, and/or (3) relieving the disease, i.e., 
5 causing regression of clinical symptoms. 

"Optional" or "optionally" means that the subsequently described circimistance 
may or may not occur, so that the description includes instances where the circumstance 
occurs and instances where it does not. 

By "pharmaceutically acceptable carrier" is meant a material which is not 
10 biologically or otherwise undesirable, i.e., the material may be administered to an 

individual along with the selected active agent without causing any undesirable biological 
effects or interacting in a deleterious manner with any of the other components of the 
pharmaceutical composition in which it is contained. 

15 11. The Compounds 

A. Compounds of Formula (I): 

In a first embodiment, the invention provides compounds comprising a peptide 
chain that binds to G-CSFR, wherein the compounds comprise a peptide chain 

20 approximately 10 to 40 amino acids in length that binds to G-CSFR and contains a 
sequence of amino acids of formula (I) 

(I) CXiX2X3X4X5X6X7X8C(SEQIDNO: 1) 
wherein each amino acid is indicated by standard one-letter abbreviation, and wherein Xj 
is A, N, S, F, D, G, L, T, E, V, P, Q, H, M or K; X2 is M, G, R, H, D, I, V, A, S, E, N, F, 

25 Y, P, C, W or T; X3 is E, V, W, F, M, A, N, S, L, T, Y, G or P; X4 is V, I, G, Q, W, M, T, 
Y, L, P, D, C, E or A; X5 is M, E, W, L, P,N, I, T, V, F, Y, Q, S, R, W, G, H or D; X^ is 
H, A, W, Y, V, F, Q, M, N, E, S, D, P or G; X^ is M, F, Y, V, N, L, H, D, S, W, G, Q, C or 
T; and Xg is C, Y, R, I, K, W, L, E, M, H, A, T, F, D, P, G or Q. 
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Preferably X, is D or P; is D or P; X3 is E or W; X4 is V, I or Y; X5 is M or 
L; Xe is W, Y or F; Xj is M, Y or D; and Xg is C or M. 

Examples of particularly preferred sequences satisfying formula (I) include, but are not 
limited to, the following: 
5 CAGEVMHMCC (SEQ ID NO: 8); 

CNREIEAMCC (SEQ ID NO: 9); 

CADEVMHFCC (SEQ ID NO: 1 0); 

CNREIMWMCC (SEQ ID NO: 1 1); 

CSHEVWWYCC (SEQ ID NO: 12); 
"5 10 CSREVLYYCC(SEQIDNO: 13); 

m CFIEGPWVCC (SEQ ID NO: 14); 

p CFVEGNWYCC (SEQ ID NO: 1 5); 

;;p; CAAEVMVNCC(SEQIDNO: 16); 

CSDEVIF YCC (SEQ ID NO: 1 7); 
h 15 CDREIMWFCC(SEQIDNO: 18); 

«1 CAHEVMWMCC(SEQIDNO: 19); 

;3 CGSEVTFMCC(SEQIDNO:20); 
5 CLEEIMWLCC(SEQIDNO:21); 

CAREVLAMCC (SEQ ID NO: 22); 
20 CSVEVMQMCC (SEQ ID NO: 23); 

CTNVQLMHYC (SEQ ID NO: 24); 

CDVWQLFDRC (SEQ ID NO: 25); 

CSFVQLNSIC (SEQ ID NO: 26); 

CDYWQWFDKC (SEQ ID NO: 27); 
25 CESFWVELWC (SEQ ID NO: 28); 

CVPWMFYDLC (SEQ ID NO: 29); 

CDPWMFYDLC (SEQ ID NO: 30); 

CDPWVLFDEC (SEQ ID NO: 31); 
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CDHWTYFDMC (SEQ ID NO: 32); 
CVVWTLYDKC (SEQ ID NO: 33); 
CPDWYQSYMC (SEQ ID NO: 34); 
CPDWYSYYMC (SEQ ID NO: 35); 
CPEWYTDVMC (SEQ ID NO: 36); 
CPDWYLDYMC (SEQ ID NO: 37); 
CPEWYLDYMC (SEQ ID NO: 38); 
CPDWYLPYMC (SEQ ID NO: 39); 
CPEWYLPYMC (SEQ ID NO: 40); 
CQDWWVELWC (SEQ ID NO: 41); 
CPDWYLPWMC (SEQ ID NO: 42); 
GACMLRVVHC (SEQ ID NO: 43); 
CQRAGYMLAC (SEQ ID NO: 44); 
CHANPVWGEC (SEQ ID NO: 45); 
CFWSDWGQTC (SEQ ID NO: 46); 
CPHWTSYYMC (SEQ ID NO: 47); 
CETLCGACFC (SEQ ID NO: 48); 
CATTINDTLC (SEQ ID NO: 49); 
CLNYPHPVFC (SEQ ID NO: 50); 
CMDGEMAVDC (SEQ ID NO: 51); 
CNMGWMSWPC (SEQ ID NO: 52) 
CETYADWLGC (SEQ ID NO: 53); 
CDPWMFFDMC (SEQ ID NO: 54); 
CDPWIWYDLC (SEQ ID NO: 55); 
CDPWIMYDRC (SEQ ID NO: 56); 
CDPWVFFDIC (SEQ ID NO: 57); 
CDPWTYYDLC (SEQ ID NO: 58); 
CDPWIFYDRC (SEQ ID NO: 59); 
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CDPWLFYDLC (SEQ ID NO: 60); 
CDPWVWYDLC (SEQ ID NO: 61); 
CDPWIFFDRC (SEQ ID NO: 62); 
CDPWMFFDQC (SEQ ID NO: 63); 
5 CDPWLWYDRC (SEQ ID NO: 64); 

CDVWVWYDQC (SEQ ID NO: 65); 
CDPWIYYDLC (SEQ ID NO: 66); 
CVPWTLFDLC (SEQ ID NO: 67); 
CPAWYLEYMC (SEQ ID NO: 68); 
10 CPDWYLEYMC (SEQ ID NO: 69); 

CKYWQWFDKC (SEQ ID NO: 70); and 
CDHWMWYDKC (SEQ ID NO: 71). 

Other preferred formula (I) sequences include, but are not limited to the following: 
15 GCNREIEAMCCG (SEQ ID NO: 72); 

GCPEWYTDVMCG (SEQ ID NO: 73); 

NWYCMDGEMAVDCEAT (SEQ ID NO: 74); 

WQSCNMGWMSWPCYFV (SEQ ID NO: 75); 

HELCETYADWLGCVEW (SEQ ID NO: 76); 
20 PCDPWMFFDMCERW (SEQ ID NO: 77); 

LRGCDPWIWYDLCPAV (SEQ ID NO: 78); 

GYLCDPWIFYDRCLGF (SEQ ID NO: 79); 

RFACDPWVFFDICGYW (SEQ ID NO: 80); 

GYWCDPWTYYDLCLTA (SEQ ID NO: 81); 
25 MWTCDPWIFYDRCFLN (SEQ ID NO: 82); 

GSSCDPWLFYDLCLLD (SEQ ID NO: 83); 

GGGCDPWVWYDLCWCD (SEQ ID NO: 84); 

YTSCDPWIFFDRCMSV (SEQ ID NO: 85); 
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DPYCDPWMFFDQCAYL (SEQ ID NO: 86); 
REFCDPWLWYDRCL (SEQ ID NO: 87); 
NTGCDVWVWYDQCFAM (SEQ ID NO: 88); 
LVFCDPWIYYDLCMDT (SEQ ID NO: 89); 
5 GCSFVQLNSICG (SEQ ID NO: 90); 

GCPAWYLEYMCG (SEQ ID NO: 91); 
GCPDWYLEYMCG (SEQ ID NO: 92); 
GCKYWQWFDKCG (SEQ ID NO: 93); and 
GCDHWMWYDKCG (SEQ ID NO: 94). 

10 

B. COMPOUNDS OF Formula (II): 

In another aspect, compounds are provided comprising a peptide chain 
approximately 9 to 40 amino acids in length that binds to G-CSFR and contains a 
sequence of amino acids of formula (II) 
15 (II) X',X'2X'3SGWVWXi4 (SEQ ID NO: 2) 

wherein each amino acid is indicated by the standard one-letter abbreviation, and wherein 
X', is S, Q, R, L or Y; X'^ is N, S, T, A or D; X'3 is E, D or N; and X'4 is L V, T, P or H. 

Preferably X\ is S or Q; X\ is S; X'3 is N; and X'4 is V. 

Examples of particularly preferred sequences satisfying formula (II) include, but 
20 are not limited to, the following: 

SNESGWVWL (SEQ ID NO: 95); 

QSNSGWVWV (SEQ ID NO: 96); 

RTESGWVWT (SEQ ID NO: 97); 

RANSGWVWV (SEQ ID NO: 98); 
25 YDNSGWVWH (SEQ ID NO: 99); and 

LSDSGWVWVP (SEQ ID NO: 100). 
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Other preferred formula (II) sequences include, but are not limited to, the following: 

EQSNSGWVWVGGGGC (SEQ ID NO: 101); 

CEQSNSGWVWV (SEQ ID NO: 102); 

EQSNSGWVWVGGGGCKKK (SEQ ID NO: 103); 
5 EQSNSGWVWVGKKKC (SEQ ID NO: 104); 

EQSNSGWVWVGKKK (SEQ ID NO: 105); 

KKKEQSNSGWVWV (SEQ ID NO: 106); 

EQSNSGWVWVGKKKSKKK (SEQ ID NO: 107); 

EQSNSGWVWVGGCKKK (SEQ ID NO: 1 08); 
10 EQSNSGWVWVGGGGGGCKKK (SEQ ID NO: 1 09); 

SNESGWVWLP (SEQ ID NO: 1 10); 

EQSNSGWVWV (SEQ ID NO: 1 1 1); 

SRTESGWVWT (SEQ ID NO: 1 1 2); 

QRANSGWVWV (SEQ ID NO: 1 13); 
15 DYDNSGWVWH(SEQIDNO: 114); 

EQSNSGWVWVGKKXK (SEQ ID NO: 1 1 5); 

EQSNSGWVWVGGGGSKKK (SEQ ID NO: 116); 

EQSNSGWVWVGGGGS (SEQ ID NO: 1 17); 

EQSNSGWVWVGGGGSEQSNSGWVWVGGGGS (SEQ ID NO: 1 1 8); 
20 RYQSFELSDSGWVWVPVARH (SEQ ID NO: 1 19); and 

EQSNSGWVWVGGGGCKKKC (SEQ ID NO: 492). 



C. Compounds of Formula (III): 

In another aspect, the invention provides compounds comprising a peptide 
25 chain approximately 6 to 40 amino acids in length that binds to G-CSFR and contains a 
sequence of amino acids of formula (III) 

(III) ERX"iX"2X"3C (SEQ ID NO: 3) 
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wherein each amino acid is indicated by standard one-letter abbreviation, and wherein X"i 
is D, L, S, G, E, A, K or Y; X\ is W, Y, F, L or V; and is F, G, M or L. 
Preferably, X", is D or L; X"2 is W; and X"3 is F. 

5 Examples of particularly preferred sequences satisfying formula (III) include, 

but are not limited to, the following: 

ERDWFC (SEQ ID NO: 120); 

ERDWGC (SEQ ID NO: 121); 

ERLWFC (SEQ ID NO: 122); 
10 ERSYFC (SEQ ID NO: 123); 

ERGWFC (SEQ ID NO: 1 24); 

EREWFC (SEQ ID NO: 125); 

ERAWFC (SEQ ID NO: 126); 

ERLYFC (SEQ ID NO: 127); 
15 ERYFMC (SEQ ID NO: 128); 

ERLFLC (SEQ ID NO: 1 29); 

ERALMC (SEQ ID NO: 130); 

ERDVMC (SEQ ID NO: 1 3 1); and 

ERKWFC (SEQ ID NO: 132). 

20 

Particulary preferred compounds are of the formula: 
ETWGERDWFC (SEQ ID NO: 133); 
ETWGERDWGC (SEQ ID NO: 1 34); 
STAERLWFCG (SEQ ID NO: 1 35); 
25 YETAERSYFC (SEQ ID NO: 136); 

ADNAERGWFC (SEQ ID NO: 137); 
QSNSEREWFC (SEQ ID NO: 138); 
STSERAWFCG (SEQ ID NO: 139); 
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ASWSERGWFC (SEQ ID NO: 140); 
ELSSEREWFC (SEQ ID NO: 141); 
DMQGERGWFC (SEQ ID NO: 142); 
SSSERAWFCG (SEQ ID NO: 143); 
5 GNMRERLYFC (SEQ ID NO: 144); 

QPNRERYFMC (SEQ ID NO: 145); 
SVTRERLFLC (SEQ ID NO: 146); 
IPLSERALMCSSWNC (SEQ IDNO: 147); 
WARSERDVMCLSYVC (SEQ ID NO: 148); 
10 QSNSEREWFCG (SEQ ID NO: 149); 

QSNSEREWFCGGGGS (SEQ ID NO: 1 50); 
NLEEALAQERLWFCRSGNC (SEQ ID NO: 151); and 
NLESYEMEERKWFCKMFSC (SEQ ID NO: 152). 

15 D. Compounds OF Formula (IV): 

In another aspect, compounds are provided comprising a peptide chain 
approximately 9 to 40 amino acids in length that binds to G-CSFR and contains a 
sequence of amino acids of formula (IV): 

(IV) X"',MVYX"'2X'"3PX"4W(SEQIDNO:4) 
20 wherein each amino acid in indicated by standard one-letter abbreviation, and wherein X"\ 
is D or E; X™2 is A or T; X"'3 is Y or V; and X"'4 is P or Y. 

Examples of particularly preferred sequences satisfying formula (IV) include, 
but are not limited to, the following: 

DMVYAYPPW (SEQ ID NO: 153); and 
25 EMVYTVPYW (SEQ ID NO: 1 54). 
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Other preferred formula (IV) sequences include, but are not limited to, the following: 
DMVYAYPPWS (SEQ ID NO: 155); and 
DEMVYTVPYW (SEQ ID NO: 156). 

5 E. Compounds OF Formula (V): 

In another aspect, compounds are provided comprising a peptide chain 
approximately 12 to 40 amino acids in length that binds to G-CSFR and contains a 
sequence of amino acids of formula (V): 

(V) CX'^iX'^2X'^3X'\Xi^5X'V'^X'^3X'\X'^,oC(SEQIDNO:5) 
10 wherein each amino acid is indicated by standard one-letter abbreviation, and wherein X'^^j 
is E, G, P, N, R, T, W, S, L, H, A, Q or Y; X'^2 is S, T, E, A, D, G, W, P, L, N, V, Y, R or 
M; X'\ is R, Y, V, Q, E, T, L, P, S, K, M, A or W; X'\ is L, M, G, F, W, R, S, V, P, A, 
D, C or T; X'^5 is V, T, A, R, S, L, W, C, I, E, P, H, F, D or Q; X'^^ is E, Y, G, T, Q, M, S, 
N, A or P; X'^ is C, V, D, G, L, W, E, V, I, S, M or A; X^\ is S, Y, A, W, P, V, L, Q, G, 
15 K, F, I, E or D; X% is R, W, M, D, H, V, G, A, Q, L, S, E or Y; X'^io is M, L, I, S, V, P, 
W,F,T,Y,R,orQ. 

Preferably X'^i is E; X'^2 is S or A; X'^3 is R; X'^ is L; X'^5 is V or S; X'^^ is 
E; is C; X'^g is S; X'\ is R; and X'^,o is L. 

Examples of particularly preferred sequences satisfying formula (V) include, but 
20 are not limited to, the following: 

CESRLVECSRMC (SEQ ID NO: 157); 
CETYMTYVYWLC (SEQ ID NO: 158); 
CGERLAECARLC (SEQ ID NO: 159); 
CESRLRECSMLC (SEQ ID NO: 160); 
25 CEARLSECSRIC (SEQ ID NO: 1 61 ); 

CPARLLECSRMC (SEQ ID NO: 162); 
CESVGVGDWWSC (SEQ ID NO: 163); 
CEDRLVEGPWVC (SEQ IDNO: 164); 
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CNDQFRTCVDVC (SEQ ID NO: 165); 

CRGEWWELYHPC (SEQ ID NO: 166); 

CEDTRTGWAWSC (SEQ ID NO: 167); 

CTWLSSGELVWC (SEQ ID NO: 1 68); 
5 CWPPVCEVSGIC(SEQIDNO: 169); 

CSLSPIQLQHLC (SEQ ID NO: 170); 

CLARLEECSRFC (SEQ ID NO: 171); 

CHNSSPMVGVTC (SEQ ID NO: 172); 

CHVSPVQIKALC (SEQ ID NO: 173); 
10 CAAPATSWFQYC (SEQ ID NO: 174); 

CASKLHECSLRC (SEQ ID NO: 175); 

CEPMDSNGIVQC (SEQ ID NO: 1 76); 

CQYASAADEQRC (SEQ ID NO: 177); 

CEYWDEPSLSWC (SEQ ID NO: 178); 
15 CERECFQMLERC (SEQ ID NO: 1 79); 

CGMSTDELDEIC (SEQ ID NO: 180); 

CYVSPSTGLYSC (SEQ ID NO: 1 8 1 ); 

CEARLVECSRLC (SEQ ID NO: 1 82); 

CESRLSECSRMC (SEQ ID NO: 1 83); 
20 CELKLQECARRC (SEQ ID NO: 1 84); 

CELKLQEAARRC (SEQ ID NO: 185); and 

CLERLEECSRFC (SEQ ID NO: 1 86). 

Other preferred formula (V) sequences include but are not limited to, the following: 
25 GGCESRLVECSRMC (SEQ ID NO: 1 87); 

GGCETYMTYVYWLC (SEQ ID NO: 188); 
EWLCESVGVGDWWSC (SEQ ID NO: 189); 
YHPCEDRLVEGPWVCCRS (SEQ ID NO: 190); 
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WLLCNDQFRTCVDVCDNV (SEQ ID NO: 191); 

lAECRGEWWELYHPCLAA (SEQ ID NO: 192); 

TWYCEDTRTGWAWSCLEL (SEQ ID NO: 193); 

QLDCTWLSSGELVWCSDW (SEQ ID NO: 194); 
5 QFDCTWLSSGELVWCSDW (SEQ ID NO: 195); 

CWPPVCEVSGICS (SEQ ID NO: 196); 

CGCSLSPIQLQHLC (SEQ ID NO: 197); 

CGCHVSPVQIKALC (SEQ ID NO: 198); 

GCHVSPVQIKALC (SEQ ID NO: 199); 
10 GTSCAAPATSWFQYCVLP (SEQ ID NO: 200); 

RMDCASKLHECSLRCAYA (SEQ ID NO: 201); 

GVVCEPMDSNGIVQCSMR (SEQ ID NO: 202); 

IDVCQYASAADEQRCLRI (SEQ ID NO: 203); 

NVLCEYWDEPSLSWCLSS (SEQ ID NO: 204); 
15 CQCERECFQMLERC (SEQ ID NO: 205); 

FCSCGMSTDELDEICAIW (SEQ ID NO: 206); 

EEVCYVSPSTGLYSCYDQ (SEQ ID NO: 207); 

LLDICELKLQECARRCN (SEQ ID NO: 208); 

GGGLLDICELKLQECARRCN (SEQ ID NO: 209); 
20 GRTGGGLLDICELKLQECARRGN (SEQ ID NO: 2 1 0); 

LGIEGRTGGGLLDICELKLQECARRCN (SEQ ID NO: 21 1); 

LLDICELKLQEAARRCN (SEQ ID NO: 212); and 

KLLDICELKLQEAARRCN (SEQ ID NO: 213). 

25 Particularly preferred formula (V) sequences are selected from the group consisting of: 
LLDICELKLQECARRCN (SEQ ID NO: 208); 
GGGLLDICELKLQECARRCN (SEQ ID NO: 209); 
GRTGGGLLDICELKLQECARRGN (SEQ ID NO: 2 1 0); 
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LGIEGRTGGGLLDICELKLQECARRCN (SEQ ID NO: 21 1); 
LLDICELKLQEAARRCN (SEQ ID NO: 212); and 
KLLDICELKLQEAARRCN (SEQ ID NO: 2 1 3). 

5 F. Compounds OF Formula (VI): 

In another aspect, compounds are provided comprising a peptide chain 
approximately 9 to 40 amino acids in length that binds to G-CSFR and contains a 
sequence of amino acids of formula (VI): 

(VI) X^,X^2X^X^4X^5X\CX\X^8(SEQIDNO:6) 
10 wherein each amino acid is indicated by standard one-letter abbreviation, and wherein X^, 
is E, C, Q, V, or Y; X\ is E, A, L, M, S, W, or Q; X^3 is K, R or T; X\ is L, A, or V; X^, 
is R, A, M, H, E, V, L, G, D, Q, or S; X^^ is E or V; X^7 is A or G; X^'j is R, H, G or L. 

Preferably X^, is E; X\ is A or L; X^3 is K or R; X^4 is L; X^'g is E; X'', is A; 
and X^j is R. 

15 Examples of particularly preferred sequences satisfying formula (VI) include, 

but are not limited to, the following: 

EEKLRECAR (SEQ ID NO: 214); 

EARLAECAR (SEQ ID NO: 215); 

CMKLMECAR (SEQ ID NO: 216); 
20 ELRLRECAH(SEQIDNO:217); 

EAKLHECAR (SEQ ID NO: 2 1 8); 

ELKLAECAR (SEQ ID NO: 219); 

EARLEECAR (SEQ ID NO: 220); 

EAKLRECAR (SEQ ID NO: 221); 
25 ELRLAECAR (SEQ ID NO: 222); 

ESRLAECAR (SEQ ID NO: 223); 

EAKLVECAR (SEQ ID NO: 224); 

ESRLRECAR (SEQ ID NO: 225); 
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EAKLAECAR (SEQ ID NO: 226); 
QWRLEECAR (SEQ ID NO: 227); 
QLRLEECAR (SEQ ID NO: 228); 
ELRLEECAR (SEQ ID NO: 229); 
EAKLLECAR (SEQ ID NO: 230); 
EARAGVCAG (SEQ ID NO: 231); 
EAKAGVCAG (SEQ ID NO: 232); 
VARLEECAR (SEQ ID NO: 233); 
ELKLDECAR (SEQ ID NO: 234); 
EWRLQECAR (SEQ ID NO: 235); 
EAKLSECAR (SEQ ID NO: 236); 
EARLSECAR (SEQ ID NO: 237); 
ELKLLECAR (SEQ ID NO: 238); 
ELRLQECGR (SEQ ID NO: 239); 
EQKLAECAR (SEQ ID NO: 240); 
ELRLQECAR (SEQ ID NO: 241); 
ELKLEECAR (SEQ ID NO: 242); 
ESRLEECAR (SEQ ID NO: 243); 
EATVQECAR (SEQ ID NO: 244); 
ELKLQECAR (SEQ ID NO: 245); 
YSRLEECGR (SEQ ID NO: 246); 
ELRLRECAL (SEQ ID NO: 247); 
EARLLECAR (SEQ ID NO: 248); 
ESRLLECAR (SEQ ID NO: 249); 
VLKLEECAR (SEQ ID NO: 250); 
ESKLAECAR (SEQ ID NO: 251); 
ESKLRECAR (SEQ ID NO: 252); 
EYKLGECAR (SEQ ID NO: 253); 
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ESRLQECAR (SEQ ID NO: 254); 
QARLAECAR (SEQ ID NO: 255); 
ELKKQECAR (SEQ ID NO: 256); 
ESRLSECAR (SEQ ID NO: 257); 
EARLEECGR (SEQ ID NO: 258); 
ESRLAECGR (SEQ ID NO: 259); 
EWRLEECAR (SEQ ID NO: 260); 
EARLSECGR (SEQ ID NO: 261); 
AARLAECAR (SEQ ID NO: 262); 
EWKLAECAR (SEQ ID NO: 263); 
ESKLEECAR (SEQ ID NO: 264); 
DVKLAECAR (SEQ ID NO: 265); 
ELQLEECAR (SEQ ID NO: 266); and 
EYKLASCAR (SEQ ID NO: 267). 



Other preferred formula (VI) sequences include but are not limited to, the following: 
RLSICEEKLRECARGC (SEQ ID NO: 268); 
PLTTCEARLAECARQL (SEQ ID NO: 269); 
LALCMKXMECARRY (SEQ ID NO: 270); 
ELVMCELRLRECAHRA (SEQ ID NO: 271); 
PLARCEAKLHECARQL (SEQ ID NO: 272); 
LLSVCELKLAECARSK (SEQ ID NO: 273); 
RLEWCEARLEECARRC (SEQ ID NO: 274); 
RLRVVEAKLRECARGR (SEQ ID NO: 275); 
CVAHLELRLAECARQI (SEQ ID NO: 276); 
HLARCESRLAECARQL (SEQ ID NO: 277); 
RLALLEAKLVECARRL (SEQ ID NO: 278); 
DLFSLESRLRECARRV (SEQ ID NO: 279); 
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AVPVLEAKIAECARRF (SEQ ID NO: 280); 
YLQQLQWRLEECARGM (SEQ ID NO: 281); 
YLELCQLRLEECARQFN (SEQ ID NO: 282); 
ELHICELRLEECARGR (SEQ ID NO: 283); 
RVARCELRLAECARKS (SEQ ID NO: 284); 
YLEVLESRLAECARWK (SEQ ID NO: 285); 
EAKLLECARAR (SEQ ID NO: 286); 
ELSLCEARAGVCAGSVTK (SEQ ID NO: 287); 
ELSLCEAKAGVCAGSVTK (SEQ ID NO: 288); 
ALWQCVARLEECARSR (SEQ ID NO: 289); 
CLKSCELKLDECARRM (SEQ ID NO: 290); 
ALQTCEWRLQECARSR (SEQ ID NO: 291); 
YISQCEAKLAECARLY (SEQ ID NO: 292); 
ELSSCEAKLSECARRW (SEQ ID NO: 293); 
ELSSCEARLSECARRW (SEQ ID NO: 294); 
QLLQCELKLLECARQG (SEQ ID NO: 295); 
ELLRCEARLAECARGC (SEQ ID NO: 296); 
QLRQCELRLQECGRHGN (SEQ ID NO: 297); 
PLTSCEQBCLAECARRF (SEQ ID NO: 298); 
LLGMCELRLQECARAK (SEQ ID NO: 299); 
ELSRCELKLEECARGM (SEQ ID NO: 300); 
DCRPCESRLEECARRL (SEQ ID NO: 301); 
RLSVCEARLEECARQL (SEQ ID NO: 302); 
PLKMCEATVQECARLI (SEQ ID NO: 303); 
LLLFCEARLSECARHV (SEQ ID NO: 304); 
SLSMCEARLAECARLL (SEQ ID NO: 305); 
PLFSCELKLQECARRCN (SEQ ID NO: 306); 
SLERCYSRLEECGRRI (SEQ ID NO: 307); 
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PLTSCELRLRECALRSN (SEQ ID NO: 308); 
KLAACELKLAECARRW (SEQ ID NO: 309); 
KLAACELRLAECARRW (SEQ ID NO: 310); 
ALTRCELRLAECARKI (SEQ ID NO: 311); 
LLQQCELKLAECARSI (SEQ ID NO: 312); 
QLWQCEARLLECARRS (SEQ ID NO: 3 1 3); 
RLRLCESRLLECARSL (SEQ ID NO: 314); 
QLETCVLKLEECARRCN (SEQ ID NO: 315); 
ALSQCELRLAECARSVTK (SEQ ID NO: 316); 
ELKLAECARRS (SEQ ID NO: 3 1 7); 
ALSRCESKLAECARRQ (SEQ ID NO: 318); 
LMSTCESKLRECARSL (SEQ ID NO: 319); 
SLQRCEYKLGECARSL (SEQ ID NO: 320); 
RLELLESRLQECARQLN (SEQ ID NO: 321); 
QMEWCQARLAECARCCN (SEQ ID NO: 322); 
PLFSCELKKQECARRCN (SEQ ID NO: 323); 
LLDKCESRLSECARRL (SEQ ID NO: 324); 
LLARCEARLEECGRQC (SEQ ID NO: 325); 
DLLYCESRLAECGRM (SEQ ID NO: 326); 
ALQMCEWRLEECARRL (SEQ ID NO: 327); 
LLTMCEARLSECGRRL (SEQ ID NO: 328); 
ALWRCESRLAECARRS (SEQ ID NO: 329); 
LLATCAARLAECARQL (SEQ ID NO: 330); 
LQTCEWKLAECARSN (SEQ ID NO: 331); 
PLRSCESKLEECARQL (SEQ ID NO: 332); 
CLRALDVKLAECARHL (SEQ ID NO: 333); 
RLKTLELQLEECARRS (SEQ ID NO: 334); 
KLRDVELKLAECARRS (SEQ ID NO: 335); 
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SLQRCEYKLASCARSL (SEQ ID NO: 336); 
RLARCELRLAECARKS (SEQ ID NO: 337); 
DLWYLESKLEECARRCN (SEQ ID NO: 338); 
DLWYLESKLEECARRANG (SEQ ID NO: 339); 
5 DLWYLESKLEECARRCNG (SEQ ID NO: 340); 

KQRELELKLAECARRS (SEQ ID NO: 341); 
QMQEWCARLAECARCCN (SEQ ID NO: 342); and 
LLDICELKLQECARRAN (SEQ ID NO: 343). 

10 A particularly preferred sequence of formula (VI) is: 

LLDICELKLQECARRAN (SEQ ID NO: 343). 

G. Compounds OF Formula (VII): 

In another aspect, the invention provides compounds comprising a peptide chain 
15 approximately 10 to 40 amino acids in length that binds to G-CSFR and contains a 
sequence of amino acids of formula (VII): 

(VII) X^,X''\X'\X^,X'\EX'\X'\X\X'^^ (SEQ ID NO: 7) 
wherein each amino acid is indicated by standard one-letter abbreviation, and wherein X^^'i 
is A, E or G; X^2 is E, H or D; is R or G; X^'^ is K, Y, M, N, Q, R, D, I, S or E; X^'5 
20 is A, S or P; X''^ is E, D, T, Q, K or A: X^\ is R, W, K, L, S, A or Q; X\ is R or E; and 
X^'g is W, G, or R. 

Preferably X^\ is A; X^^ is E; X^j is R; X^'5 is A; X^^, is E; X^V is R; X^^j is 
R; and X"^^ is W. 

Examples of particularly preferred sequences satisfying formula (VII) include, 
25 but are not limited to, the following: 

AERKAEERRW (SEQ ID NO: 344); 
AERYAEEREG (SEQ ID NO: 345); 
AERMAEERRW (SEQ ID NO: 346); 
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AERKAEERRR (SEQ ID NO: 347); 

AHRNAEERRW (SEQ ID NO: 348); 

AERKSEDWRW (SEQ ID NO: 349); 

AERKAEEKRR (SEQ ID NO: 350); 
5 AERQAETRRW (SEQ ID NO: 351); 

AERNAEERRW (SEQ ID NO: 352); 

AERQAEERRW (SEQ ID NO: 353); 

AERRAEERRW (SEQ ID NO: 354); 

AERDAEQRRW (SEQ ID NO: 355); 
10 AERIAEERRW (SEQ ID NO: 356); 

AERSAEERRW (SEQ ID NO: 357); 

AERKAEELRW (SEQ ID NO: 358); 

AERKAEESRW (SEQ ID NO: 359); 

EERKAEERRW (SEQ ID NO: 360); 
15 ADGKAEERRW (SEQ ID NO: 361); 

ADGKAEELRW (SEQ ID NO: 362); 

ADGMPEERRW (SEQ ID NO: 363); 

ADGEAEKRRW (SEQ ID NO: 364); 

ADGNAEERRW (SEQ ID NO: 365); 
20 ADGEAEKARW (SEQ ID NO: 366); 

AEGEAEKARW (SEQ ID NO: 367); 

GERKAEERRW (SEQ ID NO: 368); 

AEREAEERRW (SEQ ID NO: 369); 

ADGEAEARRW (SEQ ID NO: 370); 
25 ADGRAEEARW (SEQ ID NO: 37 1 ); 

AEGRAEEARW (SEQ ID NO: 372); 

AEREAEKARW (SEQ ID NO: 373); 

AERKAEEQRW (SEQ ID NO: 374); 
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AERDAEKRRW (SEQ ID NO: 375); and 
AEREAEKLRW (SEQ ID NO: 376). 



Other preferred formula (VI) sequences include but are not limited to, the following: 
5 MLAERKAEERRWFNTHGRE (SEQ ID NO: 377); 

MLAERKAEERRWFNTHGREK (SEQ ID NO: 378); 

GGGMLAERKAEERRWFNTHGRE (SEQ ID NO: 379); 

CMLAERKAEERRWFNTHGRE (SEQ ID NO: 380); 

CMLAERKAEERRWFNTHGREK (SEQ ID NO: 381); 
10 MLAERYAEEREGFNMQWRE (SEQ ID NO: 382); 

MLAERMAEERRWFRRMG (SEQ ID NO: 383); 

IVAERKAEERRRLNTEGHE (SEQ ID NO: 384); 

ILAHRNAEERRWFQKHGR (SEQ ID NO: 385); 

MLAERKSEDWRWLKTHGRD (SEQ ID NO: 386); 
15 MLAERKAEEKRRLKTQGRE (SEQ ID NO: 387); 

ILAERQAETRRWMRNAGSVTK (SEQ ID NO: 388); 

MLAERNAEERRWLKRQCG (SEQ ID NO: 389); 

MLAERQAEERRWLKMHGGE (SEQ ID NO: 390); 

MLAERRAEERRWLKTQGGD (SEQ ID NO: 391 ); 
20 MLAERQAEERRWLKTQGRD (SEQ ID NO: 392); 

MLAERKAEERRWFKTHGRE (SEQ ID NO: 393); 

MLAERKAEERRWFNNQGRE (SEQ ID NO: 394); 

MPAERDAEQRRWLKTHGRE (SEQ ID NO: 395); 

ILAERIAEERRWLKTQGR (SEQ ID NO: 396); 
25 MLAERKAEERRWLQTHGRE (SEQ ID NO: 397); 

ILAERSAEERRWLKTQGRE (SEQ ID NO: 398); 

LLAERKAEELRWLKTHGRE (SEQ ID NO: 399); 

MLAERKAEERRWLQTHGRE (SEQ ID NO: 400); 
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10 



%■ 15 



20 



MLAERNAEERRW (SEQ ID NO: 401); 
MFAERKAEESRWLQSQGRE (SEQ ID NO: 402); 
MLEERKAEERRWLKTHGR (SEQ ID NO: 403); 
MLAERKAEERRWLKMQGRE (SEQ ID NO: 404); 
MLAERNAEERRWFYTHGRE (SEQ ID NO: 405); 
MLADGKAEERRWLKTHGLD (SEQ ID NO: 406); 
MIADGKAEERRWLKTHGRD (SEQ ID NO: 407); 
MLADGKAEELRWLKTQGSD (SEQ ID NO: 408); 
MLAERNAEERRWLKTHGRD (SEQ ID NO: 409); 
MLADGKAEELRWLKTQGRE (SEQ ID NO: 410); 
ILADGKAEERRWLKTHGRD (SEQ ID NO: 411); 
MLADGMPEERRWLQTHGRD (SEQ ID NO: 412); 
MLADGEAEKRRWLNTHGRD (SEQ ID NO: 413); 
MLADGNAEERRWLMTHGRD (SEQ ID NO: 414); 
MLADGEAEKARWLKTQGRE (SEQ ID NO: 415); 
MLAEGEAEKARWLKTQGRE (SEQ ID NO: 416); 
MLADGKAEERRWLKTQGRE (SEQ ID NO: 417); 
MLAERKAEERRWLSAHVRE (SEQ ID NO: 418); 
LLGERKAEERRWYKTHARE (SEQ ID NO: 419); 
MLAERKAEERRWLMTHGHD (SEQ ID NO: 420); 
MLAERKAEERRWLKSQCLE (SEQ ID NO: 421); 
LLAEREAEERRWFKTHGRE (SEQ ID NO: 422); 
MLADGEAEARRWFNMHGRE (SEQ ID NO: 423); 
MLADGRAEEARWLKTQGSE (SEQ ID NO: 424); 
MLAEGRAEEARWLKTQGSE (SEQ ID NO: 425); 
MLAEREAEKARWLKTQGRE (SEQ ID NO: 426); 
MMAERKAEEQRWFDIHGRD (SEQ ID NO: 427); 
LTAERDAEKRRWLLTHGGE (SEQ ID NO: 428); 
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MLAERQAEERRWLKSQRGE (SEQ ID NO: 429); 
LLAERKAEERRWFATHGRD (SEQ ID NO: 430); 
MLAEREAEKXRWLKSQERA (SEQ ID NO: 43 1 ); 
MLAERKAEERRWLKTHGGE (SEQ ID NO: 432); 
5 KGGGMLAERKAEERRWFNTHGRE (SEQ ID NO: 490); and 

KSTGGLTAERDAEKRRWLLTHGGE (SEQ ID NO: 491). 

H. Other Active Compounds 

In another aspect of the invention, there are provided additional compounds 
10 comprising a peptide chain approximately 5 to 40 amino acids in length that binds to G- 
CSFR and contains a sequence of amino acids selected from the following compounds: 

CTWTDLESVY (SEQ ID NO: 433); 

HTTNEQFFMC (SEQ ID NO: 434); 

DTWLELESRY (SEQ ID NO: 435); 
15 HNSSPMVGVT (SEQ ID NO: 436); 

DWQKTIPAYW (SEQ ID NO: 437); 

RWGREGLVAALL (SEQ ID NO: 438); 

WSGTRVWRCVVT (SEQ ID NO: 439); 

MSLLSYLRS (SEQ ID NO: 440); 
20 LDLLAI (SEQ ID NO: 441); 

RIYGVK (SEQ ID NO: 442); 

MIWHMFMSLLF (SEQ ID NO: 443); 

FFWASWMHLLW (SEQ ID NO: 444); 

FDDCWREREQFLFQAL (SEQ ID NO: 445); 
25 CGRASECFRLLEM (SEQ ID NO: 446); 

RECFQMLER (SEQ ID NO: 447); 

CSIRWDFVPGYGLC (SEQ ID NO: 448); 

WMQCWDSLSLCYDM (SEQ ID NO: 449); 
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ALLMGESKLAECARAR (SEQ ID NO: 450); 

L AHCKKRKEECAAG (SEQ ID NO: 45 1 ); 

SIDGVYLRTSRT (SEQ ID NO: 452); 

SIDGVYLRTRSRTRY (SEQ ID NO: 453); 
5 VRWLRGSTLRGLRDR(SEQIDNO:454); 

DRGGGTVGVYWWESY (SEQ ID NO: 455); 

VWGTVGTWLEY (SEQ ID NO: 456); 

LMWVSAY (SEQ ID NO: 457); 

RASDEYGALVRFCTNL (SEQ ID NO: 458); 
5 10 NYWCDSNWVCEIA (SEQ ID NO: 459); 

m LAHCLLRLEECAAG (SEQ ID NO: 460); 

□ LALCLARLRECAGG (SEQ ID NO: 461); 
;=3 CESRLVECSRM (SEQ ID NO: 462); 

i== LLDIAELKLQECARRCN (SEQ ID NO: 463); 

□ 15 KLLDIAELKLQECCARRCN (SEQ ID NO: 464); 

CSTGGGLTAERDAEKRRWLLTHGGE (SEQ ID NO: 465) 
g LTAERDAEKRRWLLTHGGEGG (SEQ ID NO: 466); 

Q LTAERDAEKRRWLLTHGGEGGK (SEQ ID NO: 467); 

LTAERDAEKRRWLLTHGGEGGGGG (SEQ ID NO: 468); 
20 LTAERDAEKRRWLLTHGGEGGGGGK (SEQ ID NO: 469); 

ESGWVW (SEQ ID NO: 470); 

NSGWVW (SEQ ID NO: 471); 

SGWVW (SEQ ID NO: 472); 

PLGKCEATCREMARYFN (SEQ ID NO: 473); 
25 SLQRCEYKLASVRGLCN (SEQ ID NO: 474) 

DLWYLESKLEEAARRCNG (SEQ ID NO: 475); 

PYMGTRSRAKLLRQQ (SEQ ID NO: 476); 

RNAGERRWFKTQGWY (SEQ ID NO: 477); 
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MLAERNADDRRWFNTHGRD (SEQ ID NO: 478); 
MMADGRLRNSVGLILWCD (SEQ ID NO: 479); 
MLADGRLRNVVG (SEQ ID NO: 480); 
LLADVRRRNGVGLLRMGRD (SEQ ID NO: 481); 
5 MLADGRLRNFGG (SEQ ID NO: 482); 

TYMTYVYWLC (SEQ ID NO: 483); 
RFGERWGL (SEQ ID NO: 484); 
HWLWWGWNF (SEQ ID NO: 485); 
RECFQMLERC (SEQ ID NO: 486); 
10 ILAHRNABCERRWFQKHGR (SEQ ID NO: 487); and 

CSTGGGLTAERDAEKRRWLLTHGGEK (SEQ ID NO: 489). 



Particularly preferred sequences are selected from the group consisting of: 
LLDIAELKLQECARRCN (SEQ ID NO: 463); and 
15 KLLDIAELKLQECCARRCN (SEQ ID NO: 464). 



I. Synthesis of the Peptides: 

Standard solid phase peptide synthesis techniques are preferred for synthesis of 
the peptides of the present invention. Such techniques are described, for example, by 

20 Merrifield (1963) J. Am. Chem. Soc. 85:2149. As is well known in the art, solid phase 
synthesis using the Merrifield method involves successive coupling of a-amino protected 
amino acids to a growing support-bound peptide chain. After the initial coupling of a 
protected amino acid to a resin support (e.g., a polystyrene resin, a chloromethylated resin, 
a hydroxymethyl resin, a benzhydrylamine resin, or the like, depending on the chemistry 

25 used), the a-amino protecting group is removed by a choice of reagents, depending on the 
specific protecting group. Suitable a-amino protecting groups are those known to be 
useful in the art of stepwise synthesis of peptides. Included are acyl type protecting 
groups (e.g., formyl, trifluoroacetyl, acetyl), aromatic urethane type protecting groups 
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(e.g., benzyloxycarbonyl (Cbz) and substituted Cbz), aliphatic urethane protecting groups 
(e.g., t-butyloxycarbonyl (Boc), isopropyloxycarbonyl, cyclohexyloxycarbonyl), alkyl type 
protecting groups (e.g., benzyl, triphenylmethyl), fluorenylmethyl oxycarbonyl (Fmoc), 
alloxycarbonyl (Alloc) and Dde. The side chain protecting groups (typically ethers, 
5 esters, trityl, and the like) remain intact during coupling; however, the side chain 

protecting group must be removable upon completion of the synthesis of the final peptide. 
Preferred side chain protecting groups, as v^ill appreciated by those skilled in the art, will 
depend on the particular amino acid that is being protected as well as the overall chemistry 
used. After removal of the «-amino protecting group, the remaining protected amino acids 

10 are coupled stepwise in the desired order. Each protected amino acid is generally reacted 
in about a 3-fold excess using an appropriate carboxyl group activator such as 
2-(lH-benzotriazol-l-yl)-l,l,3,3 tetramethyluronium hexafluorophosphate (HBTU) or 
dicyclohexylcarbodiimide (DCC) in solution, for example, in methylene chloride 
(CH2CI2), N-methyl pyrrolidone, dimethyl formamide (DMF), or mixtures thereof 

15 Once the synthesis is complete, the compound is cleaved from the solid support 

by treatment with a reagent such as trifluoroacetic acid, preferably in combination with a 
scavenger such as ethanedithiol, P-mercaptoethanol or thioanisole. The cleavage reagent 
not only cleaves the peptide from the resin, but also cleaves all remaining side chain 
protecting groups. 

20 These procedures can also be used to synthesize peptides containing amino 

acids other than the 20 naturally occurring, genetically encoded amino acids. For instance, 
naphthylalanine can be substituted for tryptophan, with 1-Nal or 2-Nal. Other synthetic 
amino acids that can be substituted into the peptides of the present invention include, but 
are not limited to, nor-leucine and 3-pyridylalanine. 



25 
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III. Variation and Modification of the Compounds 

A. DiMER Forms (With a Terminal Linking Moiety): 

The compounds of the present invention may be in the form of a dimer, i.e., a 
compound comprised of two similar (but not necessarily identical) peptide sequences. 
5 Preferably, the dimer compounds of the invention have the structure of formula (VIII) 



10 



(VIII) 



/ 

(Lk)x 
\ 



(pA), 



n4" 



■(PA), 



n2 



(Lk)y 



(pA)n3- 



"(PA)ni 



wherein R^, nl, n2, n3, n4, x, y and Lk are defined as follows. 

R^ is a peptide chain that binds to G-CSFR and contains a sequence of amino 
acids of the present invention. R^ is also a peptide chain that binds to G-CSFR and 
15 contains a sequence of amino acids of the present invention. As previously indicated, R^ 
and R^ can be the same or different. It is preferred, however, that R^ and R^ are the same. 

PA is a p-alanine residue and may or may not be present, meaning that nl , n2, 
n3 and n4 are independently zero or 1 . 

Lk is a terminal linking moiety. If the dimer contains only one linking moiety, 
20 one of x and y is zero and the other is one. Alternatively, if the dimer contains two linking 
moieties, both x and y are one. Thus, x and y are independently zero or one with the 
proviso that the sum of x and y is either one or two. 

The terminal linking moiety Lk can be any moiety recognized by those skilled in 
the art as suitable for joining the peptides of R^ and R^. Lk is preferably although not 
25 necessarily selected from the group consisting of a disulfide bond, a carbonyl moiety and a 
Ci.12 linking moiety optionally terminated with one or two -NH- linkages and optionally 
substituted at one or more available carbon atoms with a lower alkyl substituent. 
Preferably, the terminal linking moiety comprises -NH-R^-NH- wherein R^ is lower (C^g) 



Atty Dkt 0300-00 r 
AffymaxNo. 2095 
PATENT 





alkylene substituted with a functional group such as a carboxyl group or an amino group 
that enables coupling to another molecular moiety (e.g., as may be present on the surface 
of a solid support), and is optionally substituted with a lower alkyl group. Optimally, the 
linking moiety is a lysine residue or lysine amide, i.e., a lysine residue wherein the 
5 carboxyl group has been converted to an amide moiety -CONHj. 



NH2-EQSNSGWVWVGGGGC-CONH2 (SEQ ID NO: 101) 



NH2-EQSNSGWVWVGGGGC-CONH2 (SEQ ID NO: 101); 



10 



CSTGGGLTAERDAEKRRWLLTHGGE (SEQ ID NO: 465) 
(isTGGGLTAERDAEKRRWLLTHGGE (SEQ ID NO: 489); 



15 



MLAERKAEERRWFNTHGRE (SEQ ID NO: 377) 



MLAERKAEERRWFNTHGRE-K(NH2) (SEQ ID NO: 378); 



20 



CMLAERKAEERRWFNTHGRE (SEQ ID NO: 380) 
CMLAERKAEERRWFNTHGRE^K (SEQ ID NO: 381); 



25 



LTAERDAEKRRWLLTHGGEGG (SEQ ID NO: 466) 
LTAERDAEKRRWLLTHGGEGG-i (SEQ ID NO: 467); and 



LTAERDAEKRRWLLTHGGEGGGGG (SEQ ID NO: 468) 
LTAERDAEKRRWLLTHGGEGGGGG-K (SEQ ID NO: 469). 



30 
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5 



n 15 

\ u 

20 



25 



30 



B. Disulfide Bonds: 

When a pair of cysteine residues is present in a peptide of the invention, it is 



preferred that the pair form a disulfide bond linking these residues. The disulfide bond 
may be present within a single peptide chain forming an intramolecular disulfide bond. 
Alternatively, if the compound includes an additional cysteine-containing peptide chain, 
the disulfide bond may connect the two chains. In addition, where an additional pair of 
cysteine residues exists in the compound, more than one disulfide bond may be present. 



Disulfide bond formation may be effected by techniques well known to those 



skilled in the art. One such technique involves employing a suitable oxidizing reagent 
such that a disulfide bond forms from the free thiols from a pair of cysteine residues. 
Undesired disulfide bond formation can be minimized, for example, by protecting the thiol 
groups of those cysteine residues not intended to form disulfide bonds and oxidizing the 
peptide before removal of any protecting groups. Preferred compounds having disulfide 
bonds include, by way of example, the following: 



NH2-STAERLWFCG-CONH2 (SEQ ID NO: 135) 
NH2-STAERLWFCG-CONH2 (SEQ ID NO: 135); 

NH2-QSNSEREWFC-CONH2 (SEQ ID NO: 138) 
NH2-QSNSEREWFC-CONH2 (SEQ ID NO: 138); 

NH2-QSNSEREWFCG-CONH2 (SEQ ID NO: 149) 
NH2-QSNSEREWFCG-CONH2 (SEQ ID NO: 149); 

[H]-DLWYLESKLEECARRANG-[NH2] (SEQ ID NO: 339) 
[H]-DLWYLESKLEECARRANG-[NH2] (SEQ ID NO: 339); 
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[H]-DLWYLESKLEEAARRCNG -[NHj] (SEQ ID NO: 475) 
[H]-DLWYLESKLEEAARRiNG-[NH2] (SEQ ID NO: 475); 

[H]-DLWYLESKLEECARRCNG -[NHj] (SEQ ID NO: 340); 



[H]-LLDICELKLQECARRAN-[OH] (SEQ ID NO: 343); 



[H]-LLDICELKLQEAARRCN-[OH] (SEQ ID NO: 212); 
10 I I 

[H]-K-LLDICELKLQEAARRCN-[OH] (SEQ ID NO: 231); 
[Biotin] 

[H]-LLDIAELKLQECARRCN-[OH] (SEQ ID NO: 463); 
15 I I 

[H]-KLLDIAELKLQECARRCN-[OH] (SEQ ID NO: 464); and 



NH3^-LLDICELKLQECARRCN-C0O (SEQ ID NO: 208) 
20 III 

NHs^-LLDICELKLQECARRCN-COO (SEQ ID NO: 208). 

A particularly preferred compound having disulfide bonds includes 

25 NH3^-LLDICELKLQECARRCN-C0O (SEQ ID NO: 208) 

1 11 
NHj^-LLDICELKLQECARRCN-COO (SEQ ID NO: 208). 



30 C. N-Terminal Modifications: 

(i) PEGylated Compounds 

The peptides and compounds of the invention can advantageously be modified 
with or covalently coupled to one or more of a variety of hydrophilic polymers. It has 
been found that when the peptide compounds are derivatized with a hydrophilic polymer, 
35 their solubility and circulation half-lives are increased and their immvinogenicity is 
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masked. Quite surprisingly, the foregoing can be accomplished with little, if any^ 
diminishment in binding activity. Nonproteinaceous polymers suitable for use in 
accordance with the present invention include, but are not limited to, polyalkylethers as 
exemplified by polyethylene glycol and polypropylene glycol, polylactic acid, polyglycolic 
5 acid, polyoxyalkenes, polyvinylalcohol, polyvinylpyrrolidone, cellulose and cellulose 
derivatives, dextran and dextran derivatives, etc. Generally, such hydrophilic polymers 
have an average molecular weight ranging from about 500 to about 100,000 daltons, more 
preferably from about 2,000 to about 60,000 daltons and, even more preferably, from 
about 5,000 to about 50,000 daltons. In preferred embodiments, such hydrophilic 

10 polymers have average molecular weights of about 5,000 daltons, 10,000 daltons 20,000 
daltons and 40,000 daltons. 

The peptide compounds of the invention can be derivatized with or coupled to 
such polymers using any of the methods set forth in Zallipsky (1995) Bioconjugate Chem, 
6:150-165; Monfardini et al. (1995) Bioconjugate Chem, 6:62-69; U.S. Patent No. 

15 4,640,835; U.S. Patent No. 4,496,689; U.S. Patent No. 4,301,144; U.S. Patent No. 
4,670,417; U.S. Patent No. 4,791,192; U.S. Patent No. 4,179,337 or WO 95/34326. 

In a preferred embodiment, the N-terminus of a peptide of the invention is 
coupled to a polyethylene glycol molecule. It is particularly preferred that the polymer is 
selected from the group consisting of polyethylene glycol, polypropylene glycol, polylactic 

20 acid, polyglycolic acid and derivatives and combinations thereof. Most preferably the 
polymer is polyethylene glycol (PEG), in which case the peptide is referred to as 
"PEGylated." PEG is a linear, water-soluble polymer of ethylene oxide repeating units 
with two terminal hydroxyl groups. PEGs are classified by their molecular weights which 
typically range from about 500 daltons to about 40,000 daltons. In a presently preferred 

25 embodiment, the PEGs employed have an average molecular weight of from about 500 to 
about 80,000 daltons. It is particularly preferred that the polymer has an average 
molecular weight of between about 5,000 to 40,000 daltons. 
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The PEG coupled to the peptide compounds of the invention can be either 
branched or unbranched. (See, e.g. Monfardini et aL (1995) Bioconjugate Chem. 6:62- 
69.) PEG is commercially available from Shearwater Polymers, Inc. (Huntsville, 
Alabama), Sigma Chemical Co. and other companies. Suitable PEGs include, but are not 
5 limited to, monomethoxypolyethylene glycol (MePEG-OH), monomethoxypolyethylene 
glycol-succinate (MePEG-S), monomethoxypolyethylene glycol-succinimidyl succinate 
(MePEG-S-NHS), monomethoxypolyethylene glycol-amine (MePEG-NI^), 
monomethoxypolyethylene glycol-tresylate (MePEG-TRES) and 
monomethoxypolyethylene glycol-imidazolyl-carbonyl (MePEG-IM). 

10 Briefly, in one exemplary embodiment, the hydrophilic polymer v^hich is 

employed, e.g., PEG, is capped at one terminus by an unreactive group such as a methoxy 
or ethoxy group. Thereafter, the polymer is activated at the other terminus by reaction 
with a suitable activating agent, such as a cyanuric halide (e.g., cyanuric chloride, bromide 
or fluoride), diimidazole, an anhydride reagent (e.g., a dihalosuccinic anhydride, such as 

15 dibromosuccinic anhydride), acyl azide,/7-diazoniumbenzyl ether, 

3-(/?-diazoniumphenoxy)-2-hydroxypropylether, or the like. The activated polymer is then 
reacted with a peptide compound of the invention to produce a polymer-derivatized 
peptide compound. Alternatively, a functional group in the peptide compounds of the 
invention can be activated for reaction with the polymer, or two groups can be joined in a 

20 concerted coupling reaction using known coupling methods. It will be readily appreciated 
that the peptide compounds of the invention can be derivatized with PEG using a myriad 
of other reaction schemes known to those of skill in the art. 

(ii) ACETYLATED COMPOUNDS 

In some instances, the N-terminus of the peptide is acetylated. Preferred 
25 acetylated compounds include, by way of example, the following: 
AC-ESGWVW-CONH2 (SEQ ID NO: 470); 
AC-NSGWVW-CONH2 (SEQ ID NO: 471); and 
AC-SGWVW-CONH2 (SEQ ID NO: 472). 
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The peptides and compounds of the invention can be modified with an acetyl 
moiety (Ac) using standard techniques known to those skilled in the art. One such 
technique includes combining the peptide with an acetylating reagent (e.g., acetyl chloride, 
acetic anhydride) in a suitable solvent to form the acetylated product. To the extent that 
5 other acetylated products are formed during the reaction, the N-terminus derivative can be 
isolated using conventional separation techniques. 

D, C-Terminal Modifications: 

The peptides and compounds of the invention can advantageously be modified 
10 to include an amide fimctionality at the carboxyl terminus of the peptide. Thus, it is 
preferred that the C-terminus of the peptide is amidated. 

In preparing peptides wherein the C-terminus carboxyl group is replaced by the 
amide -C(0)NR^R'^ where and R"* are independently H or lower (Cj.^) alkyl, a 
benzhydrylamine resin is preferably used as the solid support for peptide synthesis. Upon 
15 completion of the synthesis, a hydrogen fluoride treatment is employed to release the 
peptide from the support, directly resulting in the free peptide amide (i.e., the C-terminus 
is -C(0)NH2). Alternatively, use of a chloromethylated resin during peptide synthesis 
coupled with reaction with ammonia (to cleave the side chain protected peptide from the 
support) yields the fi*ee peptide amide and reaction with an alkylamine or a dialkylamine 
20 yields a side chain protected alkylamide or dialkylamide (i.e., the C-terminus is 

-C(0)NR^R'^ where R^ and R"^ are as defined above). Side chain protecting groups are then 
removed in the usual fashion by treatment with hydrogen fluoride to give the free amides, 
alkylamides, or dialkylamides. 

25 E. Other Modifications: 

One can also replace the naturally occurring side chains of the 20 genetically 
encoded amino acids (or the stereoisomeric D amino acids) with other side chains, for 
instance with groups such as alkyl, lower alkyl, cyclic 4-, 5-, 6- or 7-membered alkyl, 
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amide, amide lower alkyi, amide di(lower alkyl), lower alkoxy, hydroxy^ carboxy and the 
lower ester derivatives thereof, and 4-, 5-, 6- or 7-membered heterocyclic. In particular, 
proline analogues in which the ring size of the proline residue is changed from 5 members 
to 4, 6, or 7 members can be employed. 
5 One can also readily modify the peptides herein by phosphorylation or other 

methods as described in Hruby et al. (1990) Biochem J. 268 :249-262. Thus, the peptides 
of the invention also serve as structural models for non-peptidic compounds with similar 
biological activity. For example, the peptide backbones may be replaced with a backbone 
composed of phosphonates, amidates, carbamates, sulfonamides, secondary amines, and 
10 N-methylamino acids. 

IV. Utility 

The compounds of the invention are useful in vitro as unique tools for 
understanding the biological role of G-CSF, including the evaluation of the many factors 

15 thought to influence, and be influenced by, the production of white blood cells. The 
present compounds are also useful in the development of other compounds that bind to 
G-CSFR, because the compounds provide important structure-activity relationship (SAR) 
information that facilitates that development. 

Moreover, based on the ability to bind to G-CSFR and related receptors, a 

20 compound of the invention can be used as a reagent for detecting a G-CSF receptor or 
related receptor on living cells, fixed cells, in biological fluids, in tissue homogenates, in 
purified, natural biological materials, etc. For example, by labeling a compound of the 
invention, one can identify a cell expressing G-CSFR on its surface. In addition, based on 
it ability to bind a G-CSFR, a compoxmd of the invention can be used in in situ staining, 

25 FACS (fluorescence-activated cell sorting), Westem blotting, ELISA (enzyme-linked 
immxmoadsorptive assay), etc. In addition, because of its ability to bind to a G-CSFR, a 
compound of the invention can be used in receptor purification or in purifying cells 
expressing G-CSFR on the cell surface (or inside permeabilized cells). 
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A compound of the invention can also be utilized as a commercial research 
reagent for various medical research and diagnostic uses. Such uses include but are not 
limited to: (1) use as a calibration standard for quantitating the activities of candidate 
G-CSFR antagonists or agonists in a variety of functional assays; (2) use as a blocking 
5 reagent in random peptide screening, i.e., in searching for new families of G-CSFR 
peptide ligands; (3) use in the co-crystallization with G-CSFR, i.e., a compound of the 
invention will allow formation of crystals bound to G-CSFR, enabling the determination 
of receptor/peptide structure x-ray crystallography; (4) use in inhibiting or decreasing the 
proliferation and growth of G-CSF-dependent cell lines; and (5) other research and 

10 diagnostic applications wherein the action of G-CSFR is to be mimicked, and the like. 

A compound of the invention can also be administered to a warm blooded 
animal, including a human, to treat a disease that would benefit from the ability of a 
compound to mimic the effects of G-CSF in vivo. Thus, the present invention 
encompasses methods for treating a patient who would benefit from a G-CSFR modulator, 

15 comprising administering to the patient a therapeutically effective amount of a compound 
of the invention to activate G-CSFR. For example, a compound of this invention will find 
use in the treatment of diseases such as a depressed neutrophil count. Although 
attributable to a myriad of causes, a depressed neutrophil count is commonly associated 
with chemotherapy, AIDS and pneumonia (particularly community-acquired pneumonia). 

20 Thus, it is preferred that a compound of the present invention be used to treat a depressed 
neutrophil count selected from the group consisting of chemotherapy-induced neutropenia, 
AIDS-induced neutropenia and community-acquired pneumonia-induced neutropenia. 

In addition, the invention encompasses methods for treating a patient who 
would benefit from a G-CSFR modulator, comprising administering to the patient a 

25 therapeutically effective amount of a compound of the invention that antagonizes the 
action of G-CSF to the G-CSFR in vivo. For example, these receptor antagonists are 
administered prior to and during chemotherapy to confer chemoprotection to the 
neutrophil progenitor cells by preventing their proliferation in the presence of cytotoxic 
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drugs. Once chemotherapy administration is suspended, the administration of the 
chemoprotective G-CSFR antagonists is also suspended thereby allowing the patient's 
endogenous G-CSF to stimulate proliferation. Alternatively, the neutrophil progenitor 
cells may be "rescued" by administration of G-CSF or by a G-CSF agonist, e.g., a 
5 compound of the present invention having G-CSF agonist activity. 

Accordingly, the invention includes pharmaceutical compositions comprising, 
as an active ingredient, at least one of the compounds of the invention in association with 
a pharmaceutical carrier or diluent. The composition can be administered by oral, 
parenteral (intramuscular, intraperitoneal, intravenous (IV) or subcutaneous) injection, 
10 transdermal (either passively or using iontophoresis or electroporation), or transmucosal 
9 (nasal, vaginal, rectal, or sublingual) routes of administration, or using bioerodible inserts, 

and can be formulated in dosage forms appropriate for each route of administration. 

Solid dosage forms for oral administration include capsules, tablets, pills, 
powders, and granules. In such solid dosage forms, the active compound is admixed with 
i;3 15 at least one inert pharmaceutically acceptable carrier such as sucrose, lactose, or starch. 

Such dosage forms can also comprise, as is normal practice, an additional substance other 
^3 than an inert diluent, e.g., a lubricating agent such as magnesium stearate. In the case of 

capsules, tablets, and pills, the dosage forms may also comprise a buffering agent. Tablets 
and pills can additionally be prepared with enteric coatings. 
20 Liquid dosage forms for oral administration include pharmaceutically acceptable 

emulsions, solutions, suspensions and syrups, with the elixirs containing an inert diluent 
commonly used in the art, such as water. These compositions can also include one or 
more adjuvants, such as a wetting agent, an emulsifying agent, a suspending agent, a 
sweetening agent, a flavoring agent or a perfuming agent. 
25 Preparations for parenteral administration include sterile aqueous or 

non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents or 
vehicles are propylene glycol, polyethylene glycol, vegetable oils, such as olive oil and 
com oil, gelatin, and injectable organic esters such as ethyl oleate. Such dosage forms 
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may also contain one or more adjuvants such as a preserving agent, a wetting agent, an 
emulsifying agent and a dispersing agent. The dosage forms may be sterilized by, for 
example, filtration through a bacteria-retaining filter, by incorporating sterilizing agents 
into the compositions, by irradiating the compositions, or by heating the compositions. 
5 They can also be manufactured using sterile v^ater, or some other sterile injectable 
medium, prior to use. 

Compositions for rectal or vaginal administration are preferably suppositories 
which may contain, in addition to the active substance, an excipient such as cocoa butter 
or a suppository wax. Compositions for nasal or sublingual administration are also 
10 prepared with one or more standard excipients well known in the art. 

The dosage of active ingredient in the compositions of this invention may be 
varied; however, it is necessary that the amount of the active ingredient is such that a 
suitable dosage form is obtained. The selected dosage depends upon the desired 
therapeutic effect, the route of administration, the duration of the treatment desired, and 
15 other factors well known to those skilled in the art. Generally, dosage levels of between 
0.001 to 10 mg/kg of body weight daily are administered to mammals. 

It is to be understood that while the invention has been described in conjunction 
with the preferred specific embodiments thereof, that the foregoing description as well as 
the examples which follow are intended to illustrate and not limit the scope of the 
20 invention. Other aspects, advantages and modifications within the scope of the invention 
will be apparent to those skilled in the art to which the invention pertains. 

All patents, patent applications, and publications mentioned herein are hereby 
incorporated by reference in their entirety. 



Atty DktOSOO-OOJ 
AffymaxNo. 2095 
PATENT 



-43- 



EXPERIMENTAL 



The following examples are put forth so as to provide those of ordinary skill in 
the art with a complete disclosure and description of how to prepare and use the 
5 compounds disclosed and claimed herein. Efforts have been made to ensure accuracy with 
respect to numbers (e.g., amoimts^ temperature, etc.) but some errors and deviations 
should be accounted for. Unless indicated otherwise, parts are parts by weight, 
temperature is in °C and pressure is at or near atmospheric. 

Standard peptide synthetic methods were used, and solid phase reactions were 
10 carried out at room temperature. Unless otherwise indicated, all starting materials and 
reagents were obtained commercially, e.g., from Aldrich, Sigma and ICN, and used 
without further purification. Standard cell culture and cell harvesting procedures were 
used. 

Also, in these examples and throughout this specification, the abbreviations 
15 employed have their generally accepted meanings, as follows: 
Ac = acetyl 

BSA = bovine serum albumin 
DMSO = dimethyl sulfoxide 
DTT = dithiothreitol 
20 HPLC = high pressure liquid chromatography 

MB? = maltose binding protein 
PBS phosphate-buffered saline 

SDS PAGE = sodium dodecyl sulfate polyacrylamide gel electrophoresis 
TCEP = tris(2-carboxyethyl) phsophine 
25 TFA = trifluoroacetic acid 

Tris = tris[hydroxymethyl]aminomethane 



30 
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EXAMPLES 1-34 

G-CSF Competition Binding Assays 
The peptides of Table 1 were synthesized using standard techniques and were 
subsequently evaluated to identify whether the peptides exhibited specific and/or 
5 competitive binding. 

Specific binding is binding of a Ugand to a specific receptor, as opposed to 
non-specific binding that is mediated by non-specific interactions. Specific binding may 
be measured by subtraction of the non-specific binding (measured in the presence of 
saturating concentrations of unlabeled ligand) from the total binding (measured in the 
10 absence of saturating amounts of ligand). Typically, the unlabeled ligand used was a 
variant of G-CSF in which the cysteine normally found at position 17 was converted to 
serine (CS 17). 

Determination of competitive binding was also carried out for a number of 
peptides. Briefly stated, G-CSFR was purified using standard techniques. The receptor 

15 was then immobilized in microtiter plate wells that were coated with acid-treated 
antibody (Abl 79) specific for a site on G-CSFR not involved with G-CSF binding. 
Separately, ^^^I was coupled to the natural ligand G-CSF using techniques well known in 
the art. Test peptides were added to receptor-coated wells and allowed to bind to 
immobilized receptor for approximately 30 minutes. ^^^I labeled G-CSF was then 

20 introduced to the wells and incubated overnight at 4 ''C. Unbound labeled G-CSF was 
removed by washing the plate several times followed by measuring the amount of 
radioactivity that remained in each well using conventional techniques. If no reduction in 
the amount of bound labeled G-CSF was detected, the peptide did not compete for 
binding to the receptor. Alternatively, if reduced amounts or no ^^^I labeled G-CSF was 

25 detected, the peptide did compete. Non-positive results for a particular peptide are not 
dispositive of that peptide's activity: the peptide may exhibit binding under conditions 
different fi*om those tested. 
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The results of these assays reveal important information about the structure 
activity relationship for peptide and peptide mimetics of the invention to the G-CSF 
receptor. 

Table 1 



5 


Ex. 
No. 


Sequence 


Specific 
Binding ? 


Competitive 
Binding ? 




1 


CAGEVMHMCC (SEQ ID NO: 8) 


Yes 


Yes 




2 


CNREIEAMCC (SEQ ID NO: 9) 


Yes 


Yes 




3 


CADEVMHFCC (SEQ ID NO: 10) 


Yes 


Yes 


;f! . ' 10 


4 


CDVWQLFDRC (SEQ ID NO: 25) 


Yes 


Yes 


y 


5 


CSFVQLNSIC (SEQ ID NO: 26) 


Yes 


Yes 


i ' 


6 


CVPWMFYDLC (SEQ ID NO: 29) 


Yes 


No 




7 


CDPWMFYDLC (SEQ ID NO: 30) 


Yes 


No 




8 


CQRAGYMLAC (SEQ ID NO: 44) 


No 


No 


0 15 


9 


CHANPVWGEC (SEQ ID NO: 45) 


No 


No 




10 


CTWTDLESVY (SEQ ID NO: 433) 


No 


No 


5^1 ■ 


11 


CFWSDWGQTC (SEQ ID NO: 46) 


No 


No 




12 


CPDWYQSYMC (SEQ ID NO: 34) 


Yes 


Yes 




13 


CPHWTSYYMC (SEQ ID NO: 47) 


Yes 


Yes 


20 


14 


CACMLRVVHC (SEQ ID NO: 43) 


Yes 


Yes 




15 


CETLCGACFC (SEQ ID NO: 44) 


No 


No 




16 


SNESGWVWLP (SEQ ID NO: 110) 


Yes 


No 




17 


EQSNSGWVWV (SEQ ID NO: 1 1 1) 


Yes 


No 




18 


SRTESGWVWT (SEQ ID NO: 1 12) 


Yes 


No 


25 


19 


QRANSGWVWV (SEQ ID NO: 113) 


Yes 


No 
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20 


DYDNSGWVWH (SEQ ID NO: 1 14) 


Yes 


No 


21 


ETWGERDWFC (SEQ ID NO: 133) 


Yes 


Yes 


22 


STAERLWFCG (SEQ ID NO: 135) 


Yes 


Yes 


23 


YETAERSYFC (SEQ ID NO: 1 19) 


Yes 


Yes 


24 


ADNAERGWFC (SEQ ID NO: 137) 


Yes 


Yes 


25 


QSNSEREWFC (SEQ ID NO: 138) 


Yes 


Yes 


26 


STSERAWFCG (SEQ ID NO: 139) 


Yes 


Yes 


27 


ASWSERGWFC (SEQ ID NO: 140) 


Yes 


Yes 


28 


ELSSEREWFC (SEQ ID NO: 141) 


Yes 


Yes 


29 


DMQGERGWFC (SEQ ID NO: 142) 


Yes 


Yes 


30 


DMVYAYPPWS (SEQ ID NO: 1 55) 


Yes 


No 


31 


DEMVYTVPYW (SEQ ID NO: 156) 


Yes 


Yes 


32 


HTTNEQFFMC (SEQ ID NO: 434 ) 


Yes 


Yes 


33 


DTWLELESRY (SEQ ID NO: 435) 


Yes 


No 


34 


DWQKTIPAYW (SEQ ID NO: 437) 


Yes 


Yes 



Examples 35-73 

G-CSF Radioligand Binding Assays 

The peptides of Table 2 were synthesized using standard techniques and were 
20 subsequently evaluated to determine their binding affinities to G-CSFR. 

Streptavidin-coated scintillation proximity assay (SPA) beads (Amersham) were 
mixed with biotinylated anti-receptor immobilizing antibody (Abl79) followed by 
incubation with soluble G-CSFR harvest. Receptor-coated SPA beads were washed twice 
in PBS /0.1% BSA and distributed to wells of a white polystyrene 96-well microtiter 
25 plate (Packard). Serial dilutions of peptide or peptide mimetic were mixed with a 
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constant amount of ^^'I labeled G-CSF (lO^cpm; 1290 Ci/mmol) in PBS/0.1% BSA, 
added to wells containing receptor-coated SPA beads, and incubated overnight at 4°C. 
The binding of radiolabeled G-CSF to the receptor-coated SPA bead brings the isotope in 
close proximity to the scintillant, which allows the emitted radiation to stimulate the 
5 scintillant to emit light. Any unbound radiolabeled ligand is not in close enough 

proximity to the scintillant to allow such energy transfer and hence no signal is generated. 
The amount of ^^^I labeled G-CSF that was bound at equilibrium was measured by 
counting the plate in a TopCount (Wallac) microtiter plate luminometer. The assay is 
conducted over a range of peptide concentrations and the results are graphed such that the 

' 10 y-axis represents the amount of bound ^^^I labeled G-CSF and the x-axis represents the 

|«n concentration of peptide or peptide mimetic. One can determine the concentration at 

Q which the peptide or peptide mimetic will reduce by 50% (IC50) the amount of ^^^I labeled 

G-CSF bound to immobilized G-CSFR. The dissociation constant (K^) for the peptide 

'^^ should be similar to the measured IC50 using the assay conditions described above. 

□ 15 The peptides along with their corresponding IC50 values are shown in Table 2. 

fU IC50 values are indicated symbolically by the symbols and For examples, 

those peptides which showed IC50 values in excess of 200 uM are indicated with a 

^^3 Those peptides which gave IC50 values of less than or equal to 200 uM are given a 

while those which gave IC50 values of 500 nM or less are indicated with a "++". Those 
20 peptides, which gave IC50 values at or near the cutoff point for a particular symbol, are 
indicated with a hybrid designator, e.g., "+/-". The peptides for which IC50 values were 
not determined are listed as "N.D.". 

The results of these assays reveal important information about the structure- 
activity relationship for peptide and peptide mimetics of the invention to the G-CSF 
25 receptor. 
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Table 2 



Ex. 
No. 


Sequence 


IC,o 


35 


NH2-EQSNSGWVWV-CONH2 (SEQ ID NO: 1 1 1) 




36 


NH2-STAERLWFCG-CONH2 (SEQ ID NO: 135) 




37 


NH2-STAERLWFCG-CONH2 (SEQ ID NO: 135) 
NH2-STAERLWF(:G-C0NH2 (SEQ id NO: 1 3 5) 


+ 


JO 


>JH -OWSFREWFC-rONH-, TSEO ID NO' 138) 




jy 


MfT -DSNSFRFWFC-rONH. (SEO ID NO: 138Y 
NHj-QSNSEREWpi-CONHj (SEQ ID NO: 1 38) 




40 


NH2-QSNSEREWFCG-CONH2 (SEQ ID NO: 149) 




41 


NH2-QSNSEREWFCG-CONH2 (SEQ ID NO: 149) 
NH2-QSNSEREWFCG-CONH2 (ISEQ ID NU. 14yj 




42 


AC-ESGWVW-CONH2 (SbQ lU NU: 4 /U) 




43 


AC-NSGWVW-CONH2 (SEQ ID NO: 471) 


- 


44 


AC-SGWVW-CONH2 (J!>EQ ID NU: 4/z) 








+ 


46 


NH2-EQSNSGWVWVGGGGC-CONH2 (SEQ ID NO: 101) 
NH2-EQSNSGWVWVGGGGi-CONH2 (SEQ ID NO: 101) 


+ 


47 


CESRLVECSRM (SEQ ID NO: 462) 


+/- 


48 


LAHCLLRLEECAAG (SEQ ID NO: 460) 


+/- 


49 


ALLMCESKLAECARAR (SEQ ID NO: 450) 


+/- 


50 


DLWYLESKLEECARRANG (SEQ ID NO: 339) 
DLWYLESKLEEiARRANG (SEQ ID NO: 339) 


+ 


51 


DLWYLESKLEECARRCNG (SEQ ID NO: 340) 
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52 


DLWYLESKLEEAARRCNG (SEQ ID NO: 475) 
DLWYLESKLEEAARRCNG (SEQ ID NO: 475) 


+ 


53 


LLDICELKLQECARRCN (SEQ ID NO: 208) 


-H- 


54 


GGGLLDICELKLQECARRCN (SEQ ID NO: 209) 


++ 


55 


GRTGGGLLDICELKLQECARRCN (SEQ ID NO: 210) 


++ 


56 


LGIEGRTGGGLLDICELKLQECARRCN (SEQ ID NO: 211) 


++ 


57 


LLDICELKLQECARRAN (SEQ ID NO: 343) 


+ 


58 


LLDICELKLQEAARRCN (SEQ ID NO: 212) 


+ 


59 


Biotin-LLDICELKLQEGARRAN (SEQ ID NO: 343) 


+ 


60 


Biotin-KLLDICELKLQEAARRCN (SEQ ID NO: 213) 


+ 


61 


LLDIAELKLQECARRCN (SEQ ID NO: 463) 


+ 


62 


Biotin-KLLDIAELKLQECARRCN (SEQ ID NO: 464) 


+ 


63 


Biotin-KGGGMLAERKAEERRWFNTHGRE 

(SEQ ID NO: 490) 


+ 


64 


MLAERKAEERRWFNTHGRE (SEQ ID NO: 377) 
MLAERKAEERRWFNTHGREK (SEQ ID NO: 378) 


+/- 


65 


CMLAERKAEERRWFNTHGRE (SEQ ID NO: 3 80) 

1 \ 
CMLAERKAEERRWFNTHGREK (SEQ ID NO: 381) 


N.D. 


66 


H2N-KSTGGLTAERDAEKRRWLLTHGGE-COOH 

(SEQ ID NO: 491) 




67 


CSTGGGLTAERDAEKRRWLLTHGGE (SEQ ID NO: 465) 
tsTGGGLTAERDAEKRRWLLTHGGE (SEQ ID NO: 465) 


+ 


68 


LTAERDAEKRRWLLTHGGEGG (SEQ ID NO: 466) 
LTAERDAEKRRWLLTHGGEGgL: (SEQ id NO: 467) 




69 


LTAERDAEKRRWLLTHGGEGGGGG (SEQ ID NO: 468) 
LTAERDAEKRRWLLTHGGEGGGGgJc (SEQ ID NO: 469) 
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70 


YLELCQLRLEECARQFN (SEQ ID NO: 282) 


+ 


71 


CGCHVSPVQIKALC (SEQ ID NO: 198) 


+ 


72 


GCHVSPVQIKALC (SEQ ID NO: 199) 




73 


HELCETYADWLGCVEW (SEQ ID NO: 76) 


N.D. 



5 



EXAMPLES 74-81 

Cell Proliferation and Luminescence Assays 
The bioactivity of selected peptides of the invention was measured in cell-based 
assays. Murine NFS-60 cells proliferate in the presence of G-CSF in a dose dependent 

10 manner and were used in standard cell proliferation assays that are well known in the art. 
Murine IL-3 dependent Ba/F3 cells were co-transfected with expression vectors encoding 
the full length human G-CSFR and a luciferase reporter gene controlled by the fos 
promoter. The Ba/F3 G-CSFR reporter cell line is not only dependent on the presence of 
G-CSF for proliferation, but also produces luciferase in response to the addition of 

15 G-CSF in a dose dependent manner. The parental, untransfected cell line does not 
respond to G-CSF or produce luciferase, but remains IL-3 dependent. 

Reporter cell assays were performed on the above cell line using peptides of the 
invention. The cells were maintained in complete RPMI-1640 media containing 10% 
fetal calf serum, 2 mM L-glutamine, IX antibiotic-antimycotic solution (Life 

20 Technologies), and 10% WEHI-3 conditioned media (source of murine IL-3). For 

reporter assays, cells were starved overnight in medium which lacks WEHI-3 to reduce 
luciferase expression to background levels. The cells were then washed twice in PBS, 
resuspended in media which lacks WEHI-3 conditioned media, and added to wells of a 
96-well microtiter plate containing dilutions of peptide or G-CSF at 5 x 10"^ cells/well. 

25 Plates were incubated for 2 hours at 37 *^C in a humidified 5% CO2 incubator and 
luciferase activity was measured by the addition of luciferin (LucLite - Packard 
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Biosciences) to each well. The plates were read in a TopCount (Wallac) microtiter plate 
luminometer. 

To measure the ability of selected peptides of the invention to block G-CSF 
mediated receptor activation, dilutions of peptide were combined with Ba/F3 G-CSFR 
5 reporter cells as described above. After a 30-minute incubation at 37 "C, G-CSF was 
added to each well. The cells were incubated for 2 hours at 37 "C and the amount of 
luciferase produced was measured as described above. 

The following seven peptides were tested for bioactivity: 

NH2-EQSNSGWVWV-CONH2 (SEQ ID NO: 1 1 1); 
NH2-STAERLWFCG-CONH2 (SEQ ID NO: 135); 
NH2-STAERLWFCG-CONH2 (SEQ ID NO: 135); 
NH2-STAERLWFCG-CONH2 (SEQ ID NO: 135); 
QLETCVLKLEECARRCN (SEQ ID NO: 315); 
LLDICELKLQECARRCN (SEQ ID NO: 208); 
PLFSCELKKQECARRCN (SEQ ID NO: 323); and 
DLWYLESKLEECARRCN (SEQ ID NO: 338). 

20 Examples 74, 75, and 76 showed antagonist activity at high concentrations in 

cell-based assays using NFS-60 cells. The stability of Example 74 in cell culture medium 
was tested by overnight incubation inNFS-60-conditioned medium; no loss of activity 
was observed, indicating that the peptide is stable to degradation under these conditions. 
Examples 77, 78, 79, and 80 showed cell proliferation activity when fused to the 

25 carboxy-terminus of the maltose binding protein (MBP). The MBP fusion protein of 
Example 78 in particular showed high affinity in a binding competition assay with 
^25j_QcSp (jQ^^ ^yp^ activity in a Ba/F3 G-CSFR cell proHferation assay 
(maximal activity at 100 nM). Parental Ba/F3 cells and Ba/F3 cells expressing the human 



;S 10 Ex.74 

S Ex. 75 

Q Ex. 76 

''^ 15 Ex.77 

:J3 Ex. 78 

i°y Ex.79 

S Ex. 80 
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thrombopoietin receptor did not proliferate in response to this fusion protein. Western 
blot analysis of the fusion protein revealed both monomeric and dimeric species, however 
the G-CSFR preferentially binds the dimeric molecule. This is true for most of the MBP 
fusions tested. Presumably the fusion protein is dimerized through intermolecular 
5 disulfide bonds between cysteine residues present in the peptide sequence. Cleavage of 
the peptide from the carboxy terminus of MBP using Factor Xa caused the peptide to lose 
its bioactivity while retaining its binding activity. 

The Ba/F3 G-CSFR reporter cell line was used to measure the potency of: 
Ex. 8 1 LLDICELKLQECARRCN (SEQ ID NO: 208) 

10 and other possible G-CSF receptor antagonists. 

Ligand mediated G-CSF receptor activation in these cells results in the expression 
of luciferase, providing a detectable biological signal. Ba/F3 G-CSFR reporter cells 
responded to the addition of G-CSF in a dose dependent manner (Figure 2). The addition 
of increasing concentrations of peptide from Example 81 inhibit this G-CSF response, 
15 indicating that the peptide is a G-CSFR antagonist (Figure 3). 

Example 82 

Characterization of the Dimer Form of AF15846 

The peptide AF15846, i.e., LLDICELKLQECARRCN (SEQ ID NO: 208), was 
under study as a G-CSF antagonist for chemoprotection against chemotherapy-induced 

20 neutropenia. The peptide monomer contains three Cys residues with a mass of 2020.4 
(average). This peptide is not active as a monomer but must be oxidized, putatively to a 
dimer form, for activity. 
Monomer vs. dimer forms of AF15846: 

AF 15846 that had been oxidized in 50 mM Tris, pH 8.0 for 48 hours was diluted 

25 with PBS, then injected onto a Superdex peptide gel filtration column equilibrated in PBS 
at 0.75 mL/min. The results of this chromatography indicated that most of the peptide 
was in dimer form, with small amounts of monomer remaining (not shown). In contrast. 
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5 



10 



•v.rJ 




20 



25 



AF15846 that had been stored in acid and then diluted with PBS directly prior to injection 
onto the peptide column eluted predominantly as a monomer. Some dimerization 
apparently occurred either during storage or during the short period the peptide was at 
neutral pH prior to and during size exclusion chromatography. Oxidized peptide also 
eluted much later from a cation exchange column run in salt gradients at low pH, 
consistent with dimer formation (not shown). 
Reverse phase HPLC assay for oxidation of AF15846: 

AF15846 was oxidized by incubation in 50 mM Tris, pH 8.0, for 16 to 48 hours. 
Reverse phase HPLC methods using a Vydac 25 cm C-18 column and 0.1% 
TFA/acetonitrile buffers were developed to separate the oxidized dimer from unoxidized 
monomer, and to separate several different dimerized peptide structures. While both high 
pH reverse phase and cation exchange chromatography were also investigated, low pH 
reverse phase separation on a 25 cm column provided the best separation of the many 
oxidized forms of the peptide (not shown). The dimer species elute from the column with 
earlier retention times than do the monomer species. Samples of oxidized AFl 5846 were 
re-reduced with DTT to confirm the elution order. One additional piece of evidence for 
the formation of intermolecular dimers comes from the fact that when oxidation was 
carried out at low (0.25 mg/mL) concentrations of peptide, the reaction apparently did not 
go to completion. 

Oxidation of AF15846 under various conditions: 

AF15846 was incubated for 48 hours in 50 mM Tris, pH 8, 20% DMSO in water, 
20 mM potassium phosphate, pH 3, or 0.1% TFA at room temperature. Aliquots of each 
sample were taken at various time points. Oxidation of the monomer peptide in Tris 
resulted in the presence of one major plus one minor oxidized species after several hours. 
In contrast, oxidation of the peptide in 20% DMSO in water resulted in a complex 
mixture of oxidized species, even after the 48 hour incubation. Some oxidation of the 
peptide was observed even at acidic pH, although to a much lesser extent than that 
observed with either Tris or DMSO as the oxidant. 



Atty Dkt 0300-00 
Affymax No. 2095 
PATENT 



m 



-54- 



Activity of oxidized AF15846 fractions; 

Several fractions containing oxidized AF15846 resulting from treatment under the 
conditions described above were collected subjected to testing in two assays: an 
^^^I-G-CSF competition binding assay and an ELISA format competitive G-CSF 
5 receptor-binding assay. In both cases fractions corresponding to the predominant 

Tris-oxidized species exhibited the highest activity. The activity of selected fractions in 
the ^^^I-G-CSF competition binding assay is shown in Figure 4. While species 
corresponding to the monomer peptide were inactive, matrix-assisted laser 
desorption/ionization mass spectrometry (MALDI-MS) confirmed that the active, Tris- 
10 oxidized species was a peptide dimer. 

Determination of the disulfide structure of the active oxidized form of AF15846: 

It was hypothesized that the active form of AF 1 5846 would contain one intrachain 
disulfide per peptide monomer and one interchain peptide dimer. The three possibilities 
for this type of structure are shown below 
15 H3N^-LLDICELKLQECARRCN-COa (SEQ ID NO: 208) 



20 



25 



30 



35 



H3N^-LLDICELKLQECARRCN-COa (SEQ ID NO: 208); 
H3N"'-LLDICELKLQECARRCN-COO- (SEQ ID NO: 208) 



H3N^-LLDICELKLQECARRCN-COO (SEQ ID NO; 208); and 
H3N^-LLDICELKLQECARRCN-COO (SEQ ID NO: 208) 



H3N^-LLDICELKLQECARRCN-COa (SEQ ID NO: 208). 

To determine if one of these structures was present in the active form of AF15846, 
aliquots of Tris-oxidized AF15846 (not HPLC purified) were digested with trypsin and 
subjected to reverse phase HPLC. Trypsin digestion was carried out using an 
immobilized enzyme column from Perseptive Biosystems. Digestion was carried out in 
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20 



25 mM Tris, pH 8, 5 mM CaCla- Fractions were eluted from the column directly into 
0. 1% TFA to lower the pH and minimize disulfide scrambling. The resulting tryptic 
fragments were separated by reverse phase HPLC and analyzed by MALDI mass 
spectrometry and Edman sequencing. In addition, an aliquot of the digest was analyzed 
by electrospray liquid chromatography/mass spectrometry (LC/MS). MALDI MS and 
sequencing of the tryptic peptides indicated the presence of peptides corresponding to 
disulfide bonds between Cys-5 and Cys-5, as well as between Cysl2 and Cys-12. This 
finding indicated that there were two interchain disulfide bonds between peptide 
monomers. This result was confirmed by the LC/MS data (Figure 5), which identified 
peptides identical to those found by MALDI MS. The typtic peptides are labeled, 
beginning with the first residue, i.e., Lys, as follows: Tl = residues 1-8; T2 ^ residues 
9-14; Tl,2 = residues 1-14; T2,3 = residues 9-15; and indicates a disulfide linkage 
between peptides. However, an additional minor species was evidently present, as a 
peptide corresponding to a disulfide bond between Cys-5 and Cys-12, which could be 
either an intrachain or an interchain disulfide, was also seen, albeit at a lower level- 
To confirm that the active species contained at least two interchain disulfides, an 
aliquot of the HPLC-purified, Tris-oxidized AF 15846 shown to be active in competition 
assays was also digested with trypsin. The profile of the purified material was compared 
to that of the unfractionated Tris oxidation product (Figure 6, same labeling as in Figure 
5). The HPLC profile indicates that the purified material is lacking a peptide 
corresponding to a Cys-5 to Cys-12 disulfide-linked fragment. This indicated that the 
active species contains two interchain disulfide bonds. However, the oxidation state of the 
remaining Cys-16 in each monomer was not determined. 

The oxidized peptide was also reacted with N-ethylmaleimide (NEM) at 37 ^'C for 
1 hour in 100 mM ammonium acetate, pH 4.1 to see if any free Cys residues remained in 
the molecule. If this were the case, treatment with the alkylating reagent would result in a 
shift of the HPLC retention time. Upon incubation with NEM, no such shift was seen 
(Figure 7). In contrast, when the oxidized peptide was incubated with the disulfide 
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specific reducing agent TCEP, also in ammonium acetate, a shift to a later retention time, 
consistent with reduced peptide, was found. The reduced peptide was modified with 
NEM to produce a peptide that eluted even later than the reduced form. These data 
indicate that all six Cys residues in the AF15846 active dimer are involved in disulfide 
5 bonds. Since previous results showed that Cys-5 is linked to Cys-5 and Cys- 12 is linked 
to Cys- 12, it seems apparent that the remaining two Cys residues at position 16 of the 
monomer are also involved in an interchain disulfide bond. 

To obtain further information about the disxdfide bond structure in active 
AF 15846, the peptide was digested with Lys-C in 50 mM Tris pH 7.0/30% acetontrile. 

10 The profile of this digest is shown in Figure 8. Four major peaks are seen. The first peak 
corresponds to a dimer of residues 9-17, as indicated by the MALDI MS spectrum of this 
fraction. See Figvires 9 A and 9B. However, it is not possible to tell with this technique if 
all four Cys residues are involved in disulfide formation. The last peak contains a dimer 
of residues 1-8. The remaining two peaks represent intact peptide (22 min) and an 

15 artifact peak. This second digest clearly indicates that the peptide dimerizes into a 
parallel structure. 

This three parallel interchain disulfide structure, indicated below, is different than 
that originally predicted. Note that the arrows represent sites of cleavage by trypsin. 



AF15846 (dimer form) 
25 Incubation of the oxidized peptide at 37 °C at higher pH apparently resulted disulfide 

scrambling and/or degradation of the peptide as control peptide fractions incubated at pH 
6.0 or pH 7.5 in parallel with NEM-treated fractions exhibited complex HPLC patterns 
after incubation. It was necessary to drop to pH 4.1 to obtain clean profiles upon NEM 
treatment. 



20 



NH3^-LLDICELKLQECARRCN-COa (SEQ ID NO: 208) 
NH3^-LLDICELKLQECARRCN-COa (SEQ ID NO: 208) 



30 
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A bioassay for determining activity of G-CSF antagonists: 

A biosassay was used to measure the potency of AF 15846 and other possible G- 
CSF receptor antagonists. This bioassay utilizes a Ba/F3 cell line containing the rhGCSF 
receptor and a c-fos promoter/luciferase gene construct (Ba/F3/rhGCSF-R/pFos-lcf). 
Competent binding of a ligand to the receptor results in expression of lucifierase as the 
biological readout. Addition of AF 15846 to the assay results in the dose-response curve 
shifting to higher concentrations, indicating that the peptide is inhibiting the binding of 
G-CSF to the expressed receptor (Figures lOA and lOB). Conversely, the inclusion of 
various levels of peptide in the assay causes an increase in the amount of G-CSF required 
to produce a signal, also indicating that the peptide inhibits G-CSF binding (Figure 11). 



